Primary runtime
CLI-first orchestration
The stable surface is a Click CLI backed by Python modules, with shell wrappers kept as compatibility layers.
Open-source metagenomics whitepaper
A documentation experience rebuilt as a technical brief: not a checklist of commands, but a guided map of how this repository turns raw sequencing input into reproducible, reviewable microbiome analysis outputs.
The site is optimized for demanding readers: interviewers, maintainers, and senior open-source engineers evaluating whether the project is coherent beyond its README.
Repository truth, execution boundaries, architecture intent, and research lineage.
The current stable CLI core, the broader workflow assets, and the gap between ambition and implemented runtime surface.
Start in Academy, then move into Architecture, then drop into Guides or Research depending on whether you are operating or auditing.
Primary runtime
The stable surface is a Click CLI backed by Python modules, with shell wrappers kept as compatibility layers.
Workflow posture
The repository carries step-level WDL assets, Singularity definitions, and a Docker Compose example for reproducible environments.
Reader outcome
This site is organized to let a reviewer understand scientific scope, software boundaries, and operational trade-offs quickly.
Pipeline narrative
MICOS-2024 is best understood as an analysis story with four checkpoints: input hygiene, taxonomic evidence, ecological interpretation, and report-facing outputs.
scripts/ as specialist tools. They matter, but they are not described as part of the same stability contract as the CLI core. Stage 01
FastQC and KneadData frame the entry gate, producing cleaner reads before downstream interpretation begins.
Stage 02
Kraken2, kraken-biom, and Krona turn cleaned reads into ranked taxonomic evidence and navigable summaries.
Stage 03
QIIME2 and associated metadata joins convert abundance tables into ecological signals and cohort comparisons.
Stage 04
Functional profiling and summarization consolidate pathways, annotations, and report-facing deliverables.
System anatomy
A reviewer should be able to map the docs to the codebase: entry commands, Python modules, workflow definitions, configuration templates, containers, and validation surfaces.
micos/cli.py exposes full-run, validate-config, and module-level commands for quality control, taxonomy, diversity, functional annotation, and summarization.
steps/, deploy/, and containers/ extend the platform into reproducible environments and step-level orchestration patterns.
The project benefits from established microbiome tooling, then attempts to wrap it into a more coherent end-to-end suite. That lineage is explicit in the Research section.
Execution chain
The runtime stack spans more than one abstraction level. This diagram makes the split explicit so contributors and reviewers can reason about where each concern lives.
Research grounding
MICOS-2024 is not a blank-slate invention. It is an integration effort sitting on top of well-cited microbiome tooling. That is a strength, and this site treats it as one.
Improved metagenomic analysis with Kraken 2
Open source / paper linkReproducible, interactive, scalable and extensible microbiome data science using QIIME 2
Open source / paper linkphyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data
Open source / paper link