Try OpenMetadata as a managed service for free, from Collate.Get Started

Community Case Study

Thndr Scales Governance Across Millions of Investor Accounts with OpenMetadata

3M+

user accounts protected with automated PII detection and classification

6

data team members seamlessly managing enterprise-scale governance

Thndr Scales Governance Across Millions of Investor Accounts with OpenMetadata
Thndr is an award-winning investment platform with over 3 million users. The app serves as the entry point for 82% of all new registered investors in Egypt, adding more than 190,000 new investors to the market. In 2023, Thndr was named one of Forbes’ top 30 fintech companies, awarded Most Innovative Brokerage Firm by the Egyptian Exchange, and won the Entrepreneur's Award. This rapid growth put increasing pressure on its small data team to maintain data quality, security, and discoverability across expanding pipelines and dashboards. Manual checks for data freshness, schema changes, and PII detection were slowing the team down. OpenMetadata provided a collaborative, open source platform for unified metadata management, automated PII classification, enhanced data lineage, and robust data quality testing. The result: streamlined governance, faster discovery, and stronger trust in the data that powers investment decisions.
Industry

Financial Services

Technologies

AWS (EC2, EKS), Docker Swarm, Apache Airflow, Elasticsearch, MySQL, Slack, OpenMetadata

Quotes
"We chose OpenMetadata because it’s open source and you can easily deploy it. It’s a single solution for all your data cataloging, data governance, and data quality needs. And the community support is instant. If you reach out to them on Slack, they instantly solve your problem."
Fizza Abid
Data Platform Engineer at Thndr
Logo
Manual checks slowed a lean, already-strapped data team
Thndr’s six-person data team—two engineers, one data platform engineer, and three analysts—supports a fast-scaling investment platform that powers millions of investor accounts and processes critical financial data. As adoption surged, so did the volume and variety of data flowing through warehouses, pipelines, and dashboards. But without a centralized metadata system, the team had no easy way to trace end-to-end data lineage. Batch ETL workflows, central to Thndr’s operations, required timely, duplicate-free updates, yet quality checks for these jobs had to be performed manually. Essential governance tasks were slow, fragmented, and prone to error. The team relied on manual processes to monitor freshness, detect schema changes, identify PII, and document business terms—tasks that became increasingly unsustainable at scale. Thndr needed an open source platform that could unify metadata management, automate quality checks, and make data assets easier to find, trust, and govern across the organization.
No automated data quality alerts
Freshness, volume, and schema changes had to be checked manually, slowing batch ETL monitoring and resolution.
Poor lineage
No single view of where data originated, how it was transformed, or its downstream usage.
Inefficient PII detection
Regex-based scanning often missed sensitive values not reflected in column names.
No centralized business or technical glossary
Definitions and context were scattered across tools, making discovery and understanding time-consuming.
Siloed metadata across systems
Warehouse tables, pipeline assets, and dashboards had to be checked separately.
Limited collaboration workflows
Assigning ownership, managing incidents, and coordinating between data engineers and analysts required manual coordination.
Manual checks slowed a lean, already-strapped data team
Open source metadata governance powers faster discovery
For Thndr, the appeal of OpenMetadata went beyond features; it was about control, flexibility, and community. With OpenMetadata, the small data team at Thndr was able to build a centralized platform for metadata that works the way their team works. The team self-hosted the platform on AWS EC2, orchestrated with Docker Swarm, and connected it to Apache Airflow for workflow automation, Elasticsearch for fast search, and MySQL for storing internal metadata. Data from warehouses, pipelines, and dashboards is ingested directly into OpenMetadata, creating a single source of truth. Alerts flow directly into Slack, keeping incident response in the same channels where the team already collaborates. For production, Thndr rotated JWT tokens and implemented SSO for secure, streamlined access management.
Unified data discovery
Search and filter columns, tables, stored procedures, and views in seconds, eliminating the need to check multiple systems separately.
Centralized data cataloging
Consolidated all metadata—warehouse tables, pipelines, and dashboards—into one accessible platform.
Automated data classification
Used machine learning to identify sensitive PII values even when column names don’t match standard patterns.
End-to-end data lineage
Automatically mapped column- and table-level lineage to show data origins, transformations, and downstream usage to support transparency and troubleshooting.
Integrated quality checks
Ran custom SQL-based tests for freshness, duplication, and schema changes, with alerts routed instantly to Slack.
Glossary for shared context
Built a business and technical glossary to standardize definitions and improve data literacy across the organization.
Open source metadata governance powers faster discovery
Faster, more trusted data to strengthen investor confidence
In a fintech environment where data accuracy, security, and speed directly influence customer trust, OpenMetadata has become an operational backbone for Thndr’s data team. By deploying on their own infrastructure, Thndr ensured full control over performance, security, and customization—while the open source model meant they could quickly iterate on new needs with guidance from a responsive global community. Today, OpenMetadata underpins everything from tracing data lineage for regulatory transparency to catching stale or duplicate data in critical batch ETL workflows. What once took hours of manual checking now happens automatically, freeing the team to focus on higher-value analysis and product innovation. With OpenMetadata, Thndr has transformed governance from a reactive safeguard into a strategic advantage. Automated PII detection, real-time quality alerts, and clear lineage give their lean team complete visibility into data flows, ensuring compliance while enabling faster, more confident decision-making. Every dashboard, pipeline, and warehouse table is now part of a connected ecosystem that the team can monitor and improve continuously. The result is a data foundation that not only meets the demands of regulatory oversight but also fuels the trusted insights behind millions of investment decisions every day, building lasting investor confidence at scale.
Higher data trust
Automated quality checks reduce manual error risk, ensuring more reliable insights for investment decision-making. Built-in PII classification detects sensitive values beyond column names, reducing compliance risk and strengthening data security.
Granular access control
Attribute- and role-based permissions ensure the right people have the right level of access to sensitive data assets.
Faster operational response
Slack-based alerts allow the team to respond to data incidents in real time, minimizing downtime and service disruption.
Greater team productivity
Freed from repetitive checks, the six-person data team can focus on analytics, feature delivery, and improving the investor experience.
Improved cross-team collaboration
Data engineers, analysts, and interns use the same platform to assign tasks, resolve incidents, and access a shared glossary and centralized catalog—eliminating ambiguity and speeding up data discovery across teams.
Scalable governance
Supports growth to 3M+ users and 82% of new investors in Egypt, while maintaining governance standards at scale. Automated lineage mapping gives internal teams instant visibility into data origins, transformations, and usage.
Faster, more trusted data to strengthen investor confidence