X Tutup
The Wayback Machine - https://web.archive.org/web/20221108002105/https://github.com/kdeldycke/mail-deduplicate
Skip to content

kdeldycke/mail-deduplicate

develop
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Mail Deduplicate

Command-line tool to deduplicate mails from a set of boxes.

Stable release: Last release Python versions

Development: Unittests status Documentation Status Coverage Status

https://raw.githubusercontent.com/kdeldycke/mail-deduplicate/develop/docs/cli-coloured-header.png

Features

  • Duplicate detection based on cherry-picked and normalized mail headers.
  • Source and deduplicate mails from multiple sources.
  • Reads and writes to mbox, maildir, babyl, mh and mmdf formats.
  • Multiple duplicate selection strategies based on size, content, timestamp, file path or random choice.
  • Copy, move or delete the resulting set of mails after the deduplication.
  • Dry-run mode.
  • Protection against false-positives by checking for size and content differences.

Screenshots

https://raw.githubusercontent.com/kdeldycke/mail-deduplicate/develop/docs/cli-colored-help.png
https://raw.githubusercontent.com/kdeldycke/mail-deduplicate/develop/docs/cli-coloured-run.png

Installation

This package is available on PyPi, so you can install the latest stable release and its dependencies with a simple pip call:

$ pip install mail-deduplicate

Documentation

Docs are hosted on Read the Docs.

X Tutup