Skip to content

Diversity Analysis

Comprehensive guide to microbial community diversity analysis in MICOS-2024.


Overview

Diversity analysis measures the richness (number of taxa) and evenness (distribution of abundances) of microbial communities. These metrics provide insights into:

  • Community health: Higher diversity often associated with stability
  • Treatment effects: Changes in diversity under different conditions
  • Ecological patterns: Spatial and temporal variation
  • Comparative studies: Differences between ecosystems

Types of Diversity

Alpha Diversity (Within-Sample)

Measures diversity within individual samples:

MetricWhat it MeasuresBest Used For
RichnessNumber of taxaCommunity complexity
ShannonRichness + EvennessGeneral diversity
SimpsonDominanceDetecting dominance
Faith's PDPhylogenetic diversityEvolutionary breadth

Beta Diversity (Between-Samples)

Measures dissimilarity between samples:

MetricWeightingBest Used For
Bray-CurtisAbundanceCommunity composition
JaccardPresence/AbsenceSpecies overlap
UniFracPhylogeneticEvolutionary turnover
AitchisonCompositionZero-inflated data

Input Requirements

Data Format

FormatDescriptionSource
BIOMStandard format for microbiome dataKraken-biom, QIIME2
TSVTab-delimited feature tableCustom tables
QZAQIIME2 artifactQIIME2 exports

Metadata Requirements

ColumnDescriptionRequired For
sample-idUnique identifierAll analyses
groupExperimental groupGroup comparisons
subject-idSubject identifierPaired/longitudinal
time-pointTime of collectionLongitudinal

Running the Analysis

Option 1: MICOS CLI

bash
# Diversity analysis from BIOM file
python -m micos.cli run diversity-analysis \
  --input-biom results/taxonomic_profiling/feature-table.biom \
  --output-dir results/diversity_analysis \
  --metadata metadata.tsv

# As part of full pipeline
python -m micos.cli full-run \
  --input-dir data/raw_input \
  --results-dir results \
  --threads 16 \
  --kneaddata-db /db/kneaddata \
  --kraken2-db /db/kraken2

Option 2: Direct QIIME2

bash
# Import BIOM to QIIME2
qiime tools import \
  --input-path feature-table.biom \
  --type 'FeatureTable[Frequency]' \
  --output-path table.qza

# Rarefy table
qiime feature-table rarefy \
  --i-table table.qza \
  --p-sampling-depth 10000 \
  --o-rarefied-table table-rarefied.qza

Alpha Diversity

Metrics Overview

Richness Estimators

MetricDescriptionInterpretation
Observed FeaturesRaw count of taxaSimple richness
Chao1Estimated total richnessAccounts for unobserved taxa

Diversity Indices

MetricFormulaRangeNotes
Shannon-Σ(pᵢ × ln(pᵢ))0 to ~7Accounts for richness and evenness
Simpson1 - Σ(pᵢ²)0 to 1Probability two random reads are different

Evenness Measures

MetricDescriptionRange
Pielou's JShannon / ln(S)0-1

Implementation

bash
# Calculate alpha diversity
qiime diversity alpha \
  --i-table table.qza \
  --p-metric shannon \
  --o-alpha-diversity shannon.qza

# Multiple metrics at once
qiime diversity alpha-rarefaction \
  --i-table table.qza \
  --p-metrics shannon \
  --p-metrics chao1 \
  --p-metrics observed_features \
  --p-min-depth 1000 \
  --p-max-depth 50000 \
  --m-metadata-file metadata.tsv \
  --o-visualization alpha-rarefaction.qzv

Beta Diversity

Distance Metrics

Compositional Metrics

MetricTypeFormula Characteristics
Bray-CurtisAbundance-basedD = Σ|Aᵢ - Bᵢ| / Σ(Aᵢ + Bᵢ)
JaccardBinaryD = 1 - (|A ∩ B| / |A ∪ B|)

Phylogenetic Metrics

MetricWeightingSensitive To
Unweighted UniFracPresence/absencePhylogenetic novelty
Weighted UniFracAbundancePhylogenetic turnover

Implementation

bash
# Calculate beta diversity
qiime diversity beta \
  --i-table table.qza \
  --p-metric braycurtis \
  --o-distance-matrix braycurtis.qza

# PCoA
qiime diversity pcoa \
  --i-distance-matrix braycurtis.qza \
  --o-pcoa braycurtis-pcoa.qza

PERMANOVA

Tests if groups differ in multivariate space:

bash
qiime diversity beta-group-significance \
  --i-distance-matrix braycurtis.qza \
  --m-metadata-file metadata.tsv \
  --m-metadata-column group \
  --p-method permanova \
  --o-visualization braycurtis-permanova.qzv

Interpretation:

  • p < 0.05: Significant difference between groups
  • : Proportion of variance explained by grouping

Interpretation Guidelines

Alpha Diversity - Typical Values (Human Gut)

MetricRangeNotes
Observed features50-200Varies with sampling depth
Shannon2.5-4.5>4 indicates high diversity
Chao1100-400Estimate of total richness
Pielou's J0.6-0.9>0.8 indicates even distribution

Beta Diversity - PCoA Interpretation

PatternInterpretation
Tight clusters by groupStrong group effect
Overlapping clustersSimilar communities
Gradient patternContinuous environmental driver
OutliersUnique community composition

See Also

MICOS-2024 whitepaper for reproducible metagenomics engineering.