Offline-First
Complete cleaning, deduplication, classification and export without cloud services. Perfect for local batch processing and long-term maintenance. Rule engine responds in sub-milliseconds.
Rules-first, ML-assisted, LLM-optional. Offline-ready browser bookmark organization tool.
Complete cleaning, deduplication, classification and export without cloud services. Perfect for local batch processing and long-term maintenance. Rule engine responds in sub-milliseconds.
Adjust rules, thresholds and directory organization via config.json and vocabulary files. No code changes needed. YAML vocabularies support controlled vocabulary and faceted classification.
Layer ML, semantic analysis and optional OpenAI-compatible LLM on top of rules. Automatic fallback when services are unavailable. No worries about service availability.
Export to HTML, Markdown, JSON and more. Supports browser re-import, knowledge base archiving, and further processing needs.
Provides cleanbook CLI tool and cleanbook-wizard interactive interface. Supports batch processing and automation integration.
Multi-level feature extraction based on domain, title, and URL. Fusion of rule engine and machine learning achieves 91.4% classification accuracy.
CleanBook targets the scenario of "long-term browser bookmark maintenance": start with cleaning, deduplication and normalization, then organize links into stable, readable, and sustainably evolving classification structures based on rules and models.
I just want to organize my bookmarks
I want to understand how the system works
I want to contribute to development
| Category | Page | Description |
|---|---|---|
| Quick Start | Quick Start | Installation, minimal example, common parameters |
| User Guide | Best Practices | Configuration ideas, directory organization and maintenance tips |
| Architecture | Design Overview / System Architecture | Pipeline, module boundaries and classification strategies |
| Development | Development Guide | Environment setup, testing and extension points |
| Reference | LLM Templates | Prompt structure and optional interface configuration |
| Archive | Technical Report | Historical supplementary materials and extended notes |