MegoBin

A modular framework for metagenomic binning

MegoBin is a modular research framework for metagenomic binning β€” the task of grouping DNA fragments from a mixed microbial sample back into the genomes they came from. It defines the stages of the binning pipeline (encoders, binners, and evaluators) as interchangeable components behind clean Python Protocol interfaces, configured with Hydra, so researchers in DEIS can swap a single piece, run reproducible experiments, and compare methods with minimal boilerplate.

The longer-term aim is to let improved binners flow straight into the lab’s bio-cloud pipeline (mmlong2) and be evaluated on data from complex microbial communities (MicroBench). It builds on ideas from CheckM2, SemiBin, and revisiting k-mer profiles.