Czkawka making my CPU earn its living by finding similar images in a folder with thousands of duplicates. Love this tool, it's a must for everyone in this sub. Thank you, /u/krutkrutrar!
I've removed at least 100GB of duplicates using this tool. Running every few months is a good way to cleanup things when my download queues get unruly.
Czkawka, dupeGuru, and VisiPics are my go-to for non-exact photo duplicates. I typically run all three, since they don't all find the same duplicates. VisiPics runs fine under wine.
But Czkawka's real strength is non-exact video duplicates. The only other tool I've found that does that is videoduplicatefinder.
I need similar tool but for music. Name of the files might be different. Size might be slightly different as well but im not sure that its count as duplicates
Just tried this! It's either so efficient my CPU isn't working too hard or it focuses most of its work onto one CPU core. That said, I appreciate being able to identify the highest res version of a file, that helps a lot!
There seem to be a few of these in Github, found this one recently it seems to do a reasonable job. The real problem is its bash script you need WSL2 to run it on windows.
What I like is the output is CSV files that are spreadsheet friendly and can be used to analyze and remove files in bulk.
The real problem is its bash script you need WSL2 to run it on windows.
Eh?
WSL2 is one way to run a Linux kernel (and thus native Linux executable binaries) in Windows.
And while bash is definitely very common on Linux, it has never by any means been a strictly Linux program.
It can be used on all kinds of operating systems -- mostly unix-like operating systems, but also including Windows using a POSIX compatibility shim like Cygwin.
People were using bash in 1989, years before Linux became the beginning of a thing. And folks have been using it on Windows since at least 1995, or maybe even earlier -- decades before WSL.