Skip to content

Ankh

基于 T5 架构优化的蛋白质语言模型,使用更高效的预训练策略和数据增强 方法,在多个下游任务上达到与 ESM-2 相当的性能。该模型参数量更小, 推理效率更高。

PropertyValue
Purpose高效轻量的蛋白质语言模型
Time ComplexityO(n^2 * d)
Space ComplexityO(n^2)
Year2023
DifficultyIntermediate
LanguagesPython
CategoryProtein Language Model

Complexity Analysis

  • Time Complexity: O(n^2 * d)
  • Space Complexity: O(n^2)

Performance Insight: The time complexity of this algorithm is polynomial. High space complexity; consider Hirschberg-style space-optimized variants for very long sequences.

Note: Complexity analysis is based on theoretical models. Actual runtime is affected by input scale, hardware, and implementation optimizations. Benchmark for your specific workload.

Literature & Implementation

ESM-2 · ProtTrans · ProtBERT

Tags

language-model lightweight efficient t5-architecture

Released under the MIT License.