The Wayback Machine - https://web.archive.org/web/20200825001656/https://github.com/topics/duplicate-detection
Skip to content
#

duplicate-detection

Here are 117 public repositories matching this topic...

demystify

Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.
  • Updated Jan 25, 2020
  • Python

Improve this page

Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.