Skip to content

Shasta

基于游程编码的超快速长读段组装工具,使用 run-length 编码压缩序列数据以加速组装过程。 该方法专为 Oxford Nanopore 长读段设计,能够在极短时间内完成大规模基因组的从头组装。

PropertyValue
Purpose超快速长读段基因组组装
Time ComplexityO(n)
Space ComplexityO(n)
Year2020
DifficultyIntermediate
LanguagesC++
CategorySequence Assembly

Complexity Analysis

  • Time Complexity: O(n)
  • Space Complexity: O(n)

Performance Insight: The time complexity of this algorithm is linear (O(n)), scales linearly to TB-scale data and is suitable for streaming pipelines. Linear space can often be reduced by constant factors via sliding-window techniques.

Note: Complexity analysis is based on theoretical models. Actual runtime is affected by input scale, hardware, and implementation optimizations. Benchmark for your specific workload.

Literature & Implementation

Flye · Canu · Miniasm

Tags

long-read fast run-length ont

Released under the MIT License.