OpenAI is quietly building a generative music tool that creates songs from text and audio prompts, according to sources familiar with the project. The move puts the ChatGPT maker on a collision course with established players like Suno and Google in the rapidly expanding AI music space, potentially adding another dimension to OpenAI's multimodal AI ambitions.
OpenAI is making its next big bet on generative AI, and this time it's targeting your playlist. The company behind ChatGPT is developing a music generation tool that could transform how creators add soundtracks to videos and build musical compositions, sources tell The Information.
The tool works by taking text descriptions and audio prompts to generate original music - think "create upbeat jazz guitar for a cooking video" or "add string accompaniment to this existing vocal track." It's the kind of functionality that could make Suno, the current leader in AI music generation, very nervous about its market position.
What's particularly interesting is OpenAI's approach to training data. The company has partnered with students from the prestigious Juilliard School to annotate musical scores, creating what could be one of the most sophisticated training datasets in the AI music space. This mirrors OpenAI's strategy with other modalities - invest heavily in high-quality training data to create superior outputs.
The timing makes sense given OpenAI's broader multimodal push. The company recently launched Sora, its text-to-video generator, and having native music generation would create a powerful content creation suite. Imagine prompting: "Create a 30-second product demo video with upbeat background music" and getting both visuals and audio in one go.
But OpenAI isn't starting from scratch here. The company actually built generative music models years ago, before ChatGPT made it a household name. Those early experiments have been overshadowed by the text and image generation breakthroughs, but they provided valuable groundwork. More recently, OpenAI has been developing sophisticated audio models focused on speech synthesis and recognition.
The competitive landscape is heating up fast. Google has its own music generation experiments, while companies like Suno have built entire businesses around AI-generated music. Suno can already create full songs with lyrics from simple text prompts, and has attracted millions of users creating everything from birthday songs to viral TikTok tracks.
What remains unclear is how OpenAI plans to package this capability. Will it be a standalone product competing directly with Suno? Or will it integrate into ChatGPT and Sora as part of a broader creative toolkit? The integration approach seems more likely given OpenAI's recent strategy of building comprehensive AI assistants rather than point solutions.
The broader implications extend beyond just content creation. As AI gets better at generating music, questions around copyright, artist compensation, and the future of human creativity become more pressing. OpenAI's entry into this space will likely accelerate those conversations, especially given the company's track record of pushing boundaries in AI capabilities.
For now, OpenAI isn't commenting on the development, maintaining its typical secrecy around unreleased products. But if the timeline follows the company's recent pattern with Sora - months of limited testing before a broader rollout - we could see this music tool emerge sometime in 2026.
OpenAI's move into music generation signals the company's broader ambition to dominate every aspect of content creation. While the timeline remains uncertain, the Juilliard partnership suggests this isn't just another experimental project - it's a serious attempt to bring studio-quality AI music to mainstream creators. The real question isn't whether OpenAI can build compelling music AI, but how quickly it can integrate it into its existing ecosystem and whether established players like Suno can maintain their lead in an increasingly crowded field.