// the find
opendatadiscovery/awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
A reference list of data catalog and metadata management tools, covering OSS options (DataHub, OpenMetadata, Amundsen, Atlas) alongside cloud-native and proprietary products. The main value is the feature comparison matrices — GenAI readiness, semantic translation, data quality, lineage — laid out in a single document. Aimed at data engineers and architects evaluating tooling for metadata management.
The three-tier OSS/Cloud/Proprietary split is genuinely useful for narrowing scope before doing a real evaluation. The GenAI readiness matrix (MCP support, vector store, semantic search) is timely and not something you find compiled elsewhere. Coverage is broad — 35+ products with consistent columns — which makes cross-product comparison tractable. Actively maintained as of mid-2025, so the MCP and GenAI columns reflect the current landscape.
This is a list, not an evaluation — every product gets the same neutral treatment, so you still have to do the actual research yourself. The feature matrix cells are vendor-reported or community-guessed; there's no methodology documented, no version pinning, and some cells have '?' which undermines confidence in the rest. No guidance on which tools are production-ready vs. abandoned or barely-maintained (Grai Core, Magda, Hamilton are not in the same league as DataHub). The repo's own OSS origin (opendatadiscovery) means ODD platform is listed first in every table, which is a mild but real conflict of interest.