Answer

Detecting and Removing Redundancy in file

meena moon

792

1 Algorithms in C#

I have a very large dataset ( integer data ) in file .

I would like to search for duplicates data (int value) and then remove them from file in a rapidly way.

What would be a good algorithm for this ??

I'm reading about minhash algorithm. Is it a good way for this purpose? or is there another way??

Forum Statistics

Please welcome our newest member organizemeinc .
3,115,167 users have contributed to 147,332 threads and 483,920
In the past 24 hours, we have 0 new threads, 0 new posts, and 27 new users.
In last week, the most popular thread is 'Do You Use OpenAI More or Claude More for Daily Development?'.

Upcoming Events

View all

Our Training Programs