finds.dev← search

// the find

opendatadiscovery/awesome-data-catalogs

★ 1,041 · MIT · updated Aug 2025

📙 Awesome Data Catalogs and Observability Platforms.

A reference list of data catalog and metadata management tools, covering OSS options (DataHub, OpenMetadata, Amundsen, Atlas) alongside cloud-native and proprietary products. The main value is the feature comparison matrices — GenAI readiness, semantic translation, data quality, lineage — laid out in a single document. Aimed at data engineers and architects evaluating tooling for metadata management.

The three-tier OSS/Cloud/Proprietary split is genuinely useful for narrowing scope before doing a real evaluation. The GenAI readiness matrix (MCP support, vector store, semantic search) is timely and not something you find compiled elsewhere. Coverage is broad — 35+ products with consistent columns — which makes cross-product comparison tractable. Actively maintained as of mid-2025, so the MCP and GenAI columns reflect the current landscape.

This is a list, not an evaluation — every product gets the same neutral treatment, so you still have to do the actual research yourself. The feature matrix cells are vendor-reported or community-guessed; there's no methodology documented, no version pinning, and some cells have '?' which undermines confidence in the rest. No guidance on which tools are production-ready vs. abandoned or barely-maintained (Grai Core, Magda, Hamilton are not in the same league as DataHub). The repo's own OSS origin (opendatadiscovery) means ODD platform is listed first in every table, which is a mild but real conflict of interest.

View on GitHub →

// want more like this?

We dig through GitHub every week and send a few repos picked for what you actually care about — each with an honest take like this one.

Get finds in your inbox → Search again →