finds.dev← search

// the find

michaelliao/shici.puppylab.org

★ 516 · JavaScript · Apache-2.0 · updated Jan 2026

中华诗词网站 www.shi-ci.com 诗词文件及源码

A Chinese classical poetry archive covering 60,000+ poems from pre-Qin through modern times, stored as plain text files organized by dynasty and poet. The backend is a Spring Boot app with full-text search and a REST API. This is primarily a data repository for Chinese classical literature, not a general-purpose poetry platform.

The flat-file storage approach (one poem per .txt file, directory hierarchy as taxonomy) makes the corpus easy to diff, grep, and contribute to via pull requests — a genuinely good call for a content-heavy project. The data itself is the asset, and it's clearly the focus; 60k poems with consistent formatting is non-trivial curation. The Spring Boot app is straightforward, no clever abstractions hiding the data model. Ansible deployment config included, so you can actually run it somewhere without reverse-engineering the infrastructure.

Travis CI badge pointing at a dead CI provider (travis-ci.org shut down for open source in 2021) means the build status is meaningless and there's no current CI. The repo ships compiled artifacts (a 1.0 JAR in /release) checked into git, which is a bad habit — that JAR will drift from the source. The full-text search implementation in SearchService.java appears to be basic string matching against loaded text files rather than anything indexed, which will fall over at scale. No API documentation anywhere; you have to read the controller source to figure out what endpoints exist and what they return.

View on GitHub →

// want more like this?

We dig through GitHub every week and send a few repos picked for what you actually care about — each with an honest take like this one.

Get finds in your inbox → Search again →