// the find
michaelliao/shici.puppylab.org
中华诗词网站 www.shi-ci.com 诗词文件及源码
A Chinese classical poetry archive covering 60,000+ poems from pre-Qin through modern times, stored as plain text files organized by dynasty and poet. The backend is a Spring Boot app with full-text search and a REST API. This is primarily a data repository for Chinese classical literature, not a general-purpose poetry platform.
The flat-file storage approach (one poem per .txt file, directory hierarchy as taxonomy) makes the corpus easy to diff, grep, and contribute to via pull requests — a genuinely good call for a content-heavy project. The data itself is the asset, and it's clearly the focus; 60k poems with consistent formatting is non-trivial curation. The Spring Boot app is straightforward, no clever abstractions hiding the data model. Ansible deployment config included, so you can actually run it somewhere without reverse-engineering the infrastructure.
Travis CI badge pointing at a dead CI provider (travis-ci.org shut down for open source in 2021) means the build status is meaningless and there's no current CI. The repo ships compiled artifacts (a 1.0 JAR in /release) checked into git, which is a bad habit — that JAR will drift from the source. The full-text search implementation in SearchService.java appears to be basic string matching against loaded text files rather than anything indexed, which will fall over at scale. No API documentation anywhere; you have to read the controller source to figure out what endpoints exist and what they return.