Skip to content

GeneMark-ES

自监督的真核和原核基因组基因预测工具,通过迭代训练概率模型自动识别编码区和基因边界。 该方法覆盖原核、真核和宏基因组等多种场景,是基因预测领域应用最广泛的算法家族之一。

PropertyValue
Purpose基于概率模型的多物种基因预测
Time ComplexityO(n)
Space ComplexityO(n)
Year2005
DifficultyIntermediate
LanguagesC
CategoryGene Prediction

Complexity Analysis

  • Time Complexity: O(n)
  • Space Complexity: O(n)

Performance Insight: The time complexity of this algorithm is linear (O(n)), scales linearly to TB-scale data and is suitable for streaming pipelines. Linear space can often be reduced by constant factors via sliding-window techniques.

Note: Complexity analysis is based on theoretical models. Actual runtime is affected by input scale, hardware, and implementation optimizations. Benchmark for your specific workload.

Literature & Implementation

AUGUSTUS · Prodigal · Glimmer

Tags

gene-prediction hmm prokaryotic self-training

Released under the MIT License.