Coming May 2025
Get Early Access
aVenture is in Alpha: aVenture recently launched early public access to our research product. It's intended to illustrate capabilities and gather feedback from users. While in Alpha, you should expect the research data to be limited and may not yet meet our exacting standards. We've made the decision to temporarily present this information to showcase the product's potential, but you should not yet rely upon it for your investment decisions.
aVenture is in Alpha: aVenture recently launched early public access to our research product. It's intended to illustrate capabilities and gather feedback from users. While in Alpha, you should expect the research data to be limited and may not yet meet our exacting standards. We've made the decision to temporarily present this information to showcase the product's potential, but you should not yet rely upon it for your investment decisions.
© aVenture Investment Company, 2025. All rights reserved.
44 Tehama St, San Francisco, CA 94105
aVenture Investment Company ("aVenture") is an independent venture capital research platform providing detailed analysis and data on startups, venture capital investments, and key industry individuals.
While we strive to provide valuable insights with objectivity and professional diligence, we cannot guarantee the accuracy of the information provided on our platform. Before making any investment decisions, you should verify the accuracy of all pertinent details for your decision.
aVenture does not offer investment advisory services and is not registered as an investment adviser. The data provided by aVenture does not constitute recommendations or advice, whether by methodology or a statement written by a staff member of aVenture.
Links to external websites do not imply endorsement or affiliation with aVenture. References or links to providers offering the ability to invest in a primary or secondary transaction in a company are for convenience purposes only. They are not solicitations or offers to buy or sell an investment. Remember that past performance does not guarantee future results, and venture capital and private assets should be a contributory part of a diversified portfolio.
From TechCrunch
By David Riggs
May 15, 2024
Chang She, previously the VP of engineering at Tubi and a Cloudera veteran, has years of experience building data tooling and infrastructure. But when She began working in the AI space, he quickly ran into problems with traditional data infrastructure — problems that prevented him from bringing AI models into production.
“Machine learning engineers and AI researchers are often stuck with a subpar development experience,” She told TechCrunch in an interview. “Data infra companies don’t really understand the problem for machine learning data at a fundamental level.”
So Chang — who’s one of the co-creators of Pandas, the wildly popular Python data science library — teamed up with software engineer Lei Xu to co-launch LanceDB.
LanceDB is building the eponymous open source database software LanceDB, which is designed to support multimodal AI models — models that train on and generate images, videos and more in addition to text. Backed by Y Combinator, LanceDB this month raised $8 million in a seed funding round led by CRV, Essence VC and Swift Ventures, bringing its total raised to $11 million.
“If multimodal AI is critical to the future success of your company, you want your very expensive AI team to focus on the model and bridging the AI with business value,” Chang said. “Unfortunately, today, AI teams are spending most of their time dealing with low-level data infrastructure details. LanceDB provides the foundation AI teams need so they can be free to focus on what really matters for enterprise value and bring AI products to market much faster than otherwise possible.”
LanceDB is essentially a vector database — a database containing series of numbers (“vectors”) that encode the meaning of unstructured data (e.g. images, text and so on).
As my colleague Paul Sawers recently wrote, vector databases are having a moment as the AI hype cycle peaks. That’s because they’re useful for all manner of AI applications, from content recommendations in ecommerce and social media platforms to reducing hallucinations.
The vector database competition is fierce — see Qdrant, Vespa, Weaviate, Pinecone and Chroma to name a few vendors (not counting the Big Tech incumbents). So what makes LanceDB unique? Better flexibility, performance and scalability, according to Chang.
For one, Chang says, LanceDB — which is built on top of Apache Arrow — is powered by a custom data format, Lance Format, that’s optimized for multimodal AI training and analytics. Lance Format enables LanceDB to handle up to billions of vectors and petabytes of text, images and videos, and to allow engineers to manage various forms of metadata associated with that data.
“Until now, there’s never been a system that can unite training, exploration, search and large-scale data processing,” Chang said. “Lance Format allows AI researchers and engineers to have a single source of truth and get lightning-fast performance across their entire AI pipeline. It’s not just about storing vectors.”
LanceDB makes money by selling fully managed versions of its open source software with added features such as hardware acceleration and governance controls — and business appears to be going strong. The company’s customer list includes text-to-image platform Midjourney, chatbot unicorn Character.ai, autonomous car startup WeRide and Airtable.
Chang insisted that LanceDB’s recent VC backing wouldn’t shift its attention away from the open source project, though, which he says is now seeing around 600,000 downloads per month.
“We wanted to create something that would make it 10x easier for AI teams working with large-scale multimodal data,” he said. “LanceDB offers — and will continue to offer — a very rich set of ecosystem integrations to minimize adoption effort.”
Will Musk vs. Trump affect xAI’s $5 billion debt deal?
While the online feud between Elon Musk and President Donald Trump seemed to drive traffic to Musk’s social media platform X (formerly Twitter), it could also create issues for the platform’s parent company xAI.
Jun 7, 2025
Superblocks CEO: How to find a unicorn idea by studying AI system prompts
Brad Menezes, CEO of enterprise vibe coding startup Superblocks, is convinced that the next crop of billion-dollar startup ideas are hiding in almost plain sight: system prompts.
Jun 7, 2025
Meet the Finalists: VivaTech’s 5 Most Visionary Startups of 2025
Narrowing down the 30 most visionary startups of the year to just five finalists was no easy feat. VivaTech’s Innovation of the Year attracted an extraordinary pool of applicants—startups tackling massive global challenges with bold, technically sophisticated, and scalable solutions. From redefining human-machine interaction to revolutionizing healthcare, climate, and infrastructure, each company brought something unique […]
Jun 6, 2025