Aflorithmic, a London/Barcelona-based technology company, is pushing the boundaries of Audio-As-A-Service. They provide a platform that enables fully automated, scalable audio production by using synthetic media, voice cloning, and audio mastering, all of which can be delivered on any device, such as websites, mobile apps, or smart speakers.
With this, Aflorithmic affirms, anybody can create beautiful audio, ranging from a simple text to music and complex audio engineering. All this is possible without any previous experience in audio engineering.
For Dr. Timo Kunz, Co-Founder and CEO of Aflorithmic, the potential of synthetic audio is similar to digital photography: “In 2018, an estimated 1 trillion photos were taken. That’s over 2,7 billion a day! It is estimated that ten percent of all photos ever taken were taken in the last twelve months. We expect a similar explosion in the production of synthetic audio.”
For Kunz, it’s important to explain what synthetic audio actually is. “Synthetic audio uses algorithms to create and manipulate sound. This can be music, speech, some other sounds, or all of these mixed together. Most people will have experienced some product using text-to-speech (TTS) – that is text that is “translated” to speech – you might know it from a GPS or Siri or might have heard it on TikTok. However the latest TTS models are often indistinguishable from human speakers”.
'We are working on a project with our strategic investor Crowd Media. A dedicated team is working on Social Commerce, which is a conversational AI experience with avatars. Think Kim Kardashian having a one-on-one video conversation with each of her followers. A first version of that product is scheduled for this year. When it comes to Metaverse in the sense of a virtual world we can certainly see the audio part running on api.audio. However, we’re not actively developing anything specifically for that purpose at the moment."
Full story here