Suspended
- Joined
- 17 Oct 2011
- Posts
- 5,707
- Location
- Buckingamshire
I have two datasets. Both sets contain files in a variety of formats, but all have Windows type file extensions.
There are files common to both sets, but there are also files unique to both sets. I'd like to be able to identify which files are common, but also which are unique.
What's the best way to go about this?
I had thought that hashing all the files would be a start, then I could possibly compare the hash values or is there a smarter way of doing this?
There are files common to both sets, but there are also files unique to both sets. I'd like to be able to identify which files are common, but also which are unique.
What's the best way to go about this?
I had thought that hashing all the files would be a start, then I could possibly compare the hash values or is there a smarter way of doing this?