Aug 19, 2025
                    
                        By Stephanie Simone
                    
lakeFS, a leading “git-for-data” version control system for enterprise data and AI initiatives, announced it has raised $20 million in a growth funding round, enabling the company to accelerate its expansion supporting data engineering, AI, and ML projects in the enterprise and public sector markets.
This investment supports the rapidly growing demand for lakeFS, fueling the continued expansion of its engineering and go-to-market teams, accelerating product development, and deepening global enterprise partnerships, according to lakeFS.
The funding round brings the company’s total raised capital to $43 million and was led by Maor Investments along with existing investors Dell Technologies Capital, Norwest, and Zeev Ventures.
According to the company, thousands of organizations including Arm, Bosch, Lockheed Martin, NASA, Volvo, and the U.S. Department of Energy are already using lakeFS as part of their data management infrastructures.
“We’re still at the very beginning of the AI revolution and organizations struggle to unlock value and business efficiencies using AI,” said Dr. Einat Orr, co-founder and CEO of lakeFS. “Enterprises are adopting lakeFS as an infrastructure layer in their data and AI operations to reduce time-to-market on their AI initiatives while increasing data and model quality. This is more important than ever because the organizations that innovate fastest will be the ones that win. This funding will allow us to double down on innovation—particularly around features critical to enterprise-scale AI operations.”
Just as “Git revolutionized software development, lakeFS is reshaping enterprise AI by versioning the data that powers it,” the vendor said.
Designed for massive volumes of unstructured, semi-structured, and structured data in data lakes—text, images, audio, video—lakeFS gives organizations control, safety, and reproducibility at scale.
Using lakeFS, enterprise data, AI, and ML teams can:
- Efficiently experiment and iterate on massive datasets without duplicating storage.
- Reproduce AI/ML models and training pipelines for compliance, auditing, and traceability.
- Collaborate at scale with full control over changes to data, models, and environments.
The role of lakeFS is frequently cited by experts from leading organizations at global technology conferences as a fundamental part of their data and AI infrastructure. As data volumes scale rapidly, lakeFS provides the versioning, reproducibility, and control needed to manage complex pipelines with confidence. It empowers teams to experiment safely and collaborate more efficiently, making it an essential component in delivering reliable and scalable data, AI and ML operations, according to the company.
For more information about this news, visit https://lakefs.io.