Try OpenMetadata as a managed service for free, from Collate.Get Started

Community Case Study

Loggi transforms data for more efficient and reliable deliveries

30%

faster ETL pipeline to optimize route mapping

89%

fewer Looker dashboards to mitigate data sprawl

7,000

Unneeded Redshift tables eliminated

Loggi is one of Brazil’s largest logistics companies, delivering over 300,000 packages per day across all 27 states—including hard-to-reach regions like Manaus in the Amazon. With more than 2,000 employees producing and consuming data across a sprawling operation, the company faced mounting challenges in data discoverability, trust, and reporting accuracy. OpenMetadata provided the open-source foundation Loggi needed to organize ownership, surface data lineage, and clean up their environment. Loggi streamlined their data infrastructure by reducing Looker dashboards from 18,000 to 2,000 and eliminating 7,000 Redshift tables, which significantly lowered the human costs associated with discovery, maintenance, and the analysis of outdated information. Today, OpenMetadata powers a more transparent and governable data practice at Loggi—one built for scale, community, and long-term clarity.
Industry

Logistics / Supply Chain

Technologies

Redshift, dbt, Looker, OpenMetadata

Quotes
"OpenMetadata’s data quality feature helps us proactively monitor key data sources, enabling faster incident response and improving data reliability—critical for maintaining efficient operations and accurate delivery schedules."
Erica Bertan
Analytics Engineering Manager at Loggi
Logo
Challenges: Dashboard sprawl leads to inconsistent data, impacting operational efficiency
With nearly 100 ETL jobs running each day and daily ingestion reaching 200GB of data and 9 million package tracking records, coordinating data governance across Loggi’s 100TB Redshift warehouse quickly became unscalable. Their most critical ETL pipeline, the “midnight job,” ran every night for eight hours, updating nearly 500 tables to power next-day operations. As new data assets and models flooded the system, discoverability and trust began to erode, leading to duplicated work, inefficiencies, and delayed decision-making. This directly impacted their ability to optimize delivery routes and schedules, potentially causing delays in the logistics chain.
Undefined ownership
With nearly 500 dbt models and 10,000 warehouse tables, identifying who could explain the business logic behind a dataset often required multiple team members, slowing down problem resolution and decision-making.
Inconsistent reporting
Looker contained 18,000 dashboards with overlapping or conflicting metrics, making it hard for teams to trust and align on a single source of truth, causing delays in route planning and logistics decisions.
Low discoverability
Analysts struggled to find the correct tables to use, leading to duplicated work, inefficiencies, and repeated questions across teams—slowing down the logistics optimization process.
Technical complexity
Similar-sounding tables like package_events and package_register created confusion even among experienced users, slowing time to insight and hindering the optimization of delivery routes.
Lack of observability
Without proactive alerts or data quality checks, errors often went unnoticed until downstream teams were already impacted, potentially affecting delivery timelines and operational reliability.
Challenges: Dashboard sprawl leads to inconsistent data, impacting operational efficiency
Solutions: A shared, open source layer for smarter data governance
Loggi turned to OpenMetadata to help bring structure, ownership, and observability to their complex, fast-growing ecosystem. As a centralized metadata management layer, OpenMetadata gave Loggi the flexibility to tailor the platform to their unique data landscape, allowing their internal teams to move quickly without being locked into a rigid commercial solution. From cataloging and lineage to proactive data quality checks and alerts, OpenMetadata became the connective tissue powering better collaboration, observability, and decision-making across the company.
Ownership-driven modeling
Assigned clear owners to their dbt models, making it easier to triage incidents and understand business logic behind datasets, which improved the speed of operational decisions and reduced routing delays.
End-to-end lineage tracking
Visualized downstream dependencies and retired outdated models safely, reducing risk and helping teams evolve with confidence in their data, improving the accuracy of logistics data.
Streamlined dashboard governance
Identified outdated dashboards in Looker and facilitated a company-wide cleanup effort to remove redundant reports. This cleanup reduced reporting inefficiencies, making it easier for teams to access relevant metrics that supported timely logistics decisions.
Metadata-driven cataloging
Cataloged hundreds of tables with multilingual (Portuguese and English) column and table descriptions, helping teams discover and understand assets faster, allowing for quicker adaptation to changing logistics needs.
Proactive data quality modeling
Enabled tests on critical models and triggered alerts when data behaved unexpectedly, boosting observability and trust in data, which directly improved logistics planning and delivery accuracy.
Solutions: A shared, open source layer for smarter data governance
Results: Faster pipelines and leaner infrastructure leads to optimized logistics
With OpenMetadata, Loggi turned a scattered, hard-to-navigate data environment into a governed, trusted, and streamlined foundation for decision-making. By surfacing ownership, enabling data quality checks, and cleaning up outdated dashboards and warehouse assets, the team now spends less time chasing down issues—and more time improving performance. What began as an open-source metadata platform quickly became a critical enabler of scale, savings, and trust across the organization.
Faster ETL performance
Loggi’s most critical job—the nightly pipeline that updates nearly 500 tables—now runs 30% faster, enabling more timely updates to delivery schedules and route optimization.
Reduced infrastructure costs
After deleting 7,000 unused Redshift tables, the company saved $2,000 a month in infrastructure costs and freed up resources to better align with business priorities.
Lighter analytics layer
Reduced Looker dashboards from 18,000 to just 2,000 by deprecating unused or duplicative reports—eliminating unnecessary overhead and making critical metrics more accessible to decision-makers.
Stronger trust in data
Monitored key models with automated data quality checks and alerts, ensuring that logistics operations are always running on accurate and reliable data.
More scalable governance
Established a centralized, metadata-driven foundation for ownership, cataloging, and observability—without vendor lock-in.
Results: Faster pipelines and leaner infrastructure leads to optimized logistics