Skip to content

Foldseek

超高速蛋白质结构搜索工具,通过将三维结构编码为一维的 3Di 字母序列实现比传统方法快数万倍的结构相似性搜索。 该工具支持对大型结构数据库进行快速检索,是蛋白质功能注释、同源发现和结构聚类的重要利器。

PropertyValue
Purpose超快速蛋白质三维结构相似性搜索
Time ComplexityO(n)
Space ComplexityO(n)
Year2023
CategoryProtein Structure Prediction

Complexity Analysis

  • Time Complexity: O(n)
  • Space Complexity: O(n)

Performance Insight: The time complexity of this algorithm is linear (O(n)), scales linearly to TB-scale data and is suitable for streaming pipelines. Linear space can often be reduced by constant factors via sliding-window techniques.

Note: Complexity analysis is based on theoretical models. Actual runtime is affected by input scale, hardware, and implementation optimizations. Benchmark for your specific workload.

Literature & Implementation

TM-align · DALI · MMseqs2

Tags

structure-search fast 3Di structural-alignment

Released under the MIT License.