Foldseek
超高速蛋白质结构搜索工具,通过将三维结构编码为一维的 3Di 字母序列实现比传统方法快数万倍的结构相似性搜索。 该工具支持对大型结构数据库进行快速检索,是蛋白质功能注释、同源发现和结构聚类的重要利器。
| Property | Value |
|---|---|
| Purpose | 超快速蛋白质三维结构相似性搜索 |
| Time Complexity | O(n) |
| Space Complexity | O(n) |
| Year | 2023 |
| Category | Protein Structure Prediction |
Complexity Analysis
- Time Complexity:
O(n) - Space Complexity:
O(n)
Performance Insight: The time complexity of this algorithm is linear (O(n)), scales linearly to TB-scale data and is suitable for streaming pipelines. Linear space can often be reduced by constant factors via sliding-window techniques.
Note: Complexity analysis is based on theoretical models. Actual runtime is affected by input scale, hardware, and implementation optimizations. Benchmark for your specific workload.
Literature & Implementation
Related Tools
TM-align · DALI · MMseqs2