In my previous post I described a method for eliminating duplicate SmugMug images based on MD5’s. This method does not get all the duplicates.
If you uploaded the same image to different galleries, at different times, the SmugMug MD5’s may differ. I don’t know what SmugMug rolls into their MD5 ‘s but I suspect it’s more than image data.
To get all the copies you must remove duplicate picture ids , file names and MD5’s. Furthermore, to maintain a pure duplicate free state, you need to check these items often. Now, after uploading or rearranging pictures, I update my SmugMug metadata and execute this J verb to insure I don’t introduce duplicates.
CheckSmugDups=:3 : 0NB.*CheckSmugDups v-- checks duplicate SmugMug images.NB.NB. monad: CheckSmugDups uuIgnore'albums images'=.readsmugtables0images=.}.images[imhead=.0{imagesNB. images should be unique in three ways:r=.,:'PID unique: ';#~.(imheadi.<'PID'){"1imagesr=.r,'MD5 unique: ';#~.(imheadi.<'MD5'){"1imagesr=.r,'FILENAME unique: ';#~.(imheadi.<'FILENAME'){"1imagesif.1<#~.;{:"1rdo.smoutput'duplicates present'end.r)
Feel free to download the code – it’s linked in the post. The implementation language is not well known but the code is relatively easy to follow. There may be more mainstream solutions. Googling “JavaScript” and “SmugMug duplicates” would be a good place to start.
Any chance this duplicate finder is available to anyone (with instructions how to implement)?
Feel free to download the code – it’s linked in the post. The implementation language is not well known but the code is relatively easy to follow. There may be more mainstream solutions. Googling “JavaScript” and “SmugMug duplicates” would be a good place to start.