Back
Nov 26, 2025
Author
Today we are thrilled to share a big milestone. Besimple AI has raised $3M to build the data layer for AI, starting with audio. We are grateful to be funded by Y Combinator, Surgepoint Capital, Porterfield Ventures, Amino Capital, WELIGHT Capital, Multimodal Ventures, Script Capital, and a number of amazing angel investors.
The idea came from a problem we had at Meta. Training AI models requires massive amounts of data, but getting high quality data is so hard. We spent years building this for the Llama team at Meta and we are bringing our operational expertise to the audio space. Audio is the most natural interface for generative AI, and more data is needed to train the next generation conversational models.
We start with data collection, curating our own proprietary set of diverse conversational data covering a wide range of languages, dialects and accents. We then leverage human expert audio annotators and our own annotation platform to process audio data for Automatic Speech Recognition. With human level transcription and diarization, our data help push the audio model frontier. Today we have over millions of hours of conversational data, and growing.
This is just the start. Thank you to our team, our early customers, and our investors for believing in this mission. We're hiring. If you want to help build the data layer for AI, join us.
Audio data should besimple :)
