ElevenLabs CEO Mati Staniszewski on the AI Voice Revolution



The global leader in voice generation AI, ElevenLabs, is coming to Japan for the first time. Its co-founder and CEO, Mati Staniszewski, will finally visit Japan on November 18, 2025.
This event offers a valuable opportunity to hear directly from Mati, who is at the forefront of AI voice technology, about how "voice AI" is transforming both creative production and enterprise DX in Japan.
AICU media recognizes that as generative AI advances in image, video, and animation, voice is the last major creative domain remaining, and is rapidly gaining attention as a central technology essential for AI film and AI video production.
■ "Voice AI Revolution" - Mati discusses the latest strategies and commitment to the Japanese market
Mati's keynote speech will discuss ElevenLabs' next-generation strategy from the following perspectives:
- Evolution of emotionally rich narration generation technology
- The future of AI agents capable of real-time voice interaction
- Acceleration of content creation and creation of new experience value
- Specific prospects and commitment in the Japanese market
ElevenLabs is rapidly growing as a voice AI platform that crosses all fields such as film, animation, games, education, and customer support.
It can be said that the era in which AI "speaks" and "expresses emotions" is finally about to begin in earnest.
■ Top creators and companies from Japan will also be on stage
Following the keynote speech, advanced usage examples in Japan will be introduced.
Hiroki Yamaguchi (Film Director / Gauma Pix Inc.)
Explaining behind the scenes of the generative AI movie "Grandma Revito." The new era of film production workflow using AI video x AI voice will be discussed.
Takahiro Abe (SI&C Co., Ltd.)
The outlook for voice-based AI agents in the manufacturing and finance industries. The value of "voice" in corporate business transformation will be demonstrated.
The possibilities of voice AI in both creative and business aspects will be disseminated from Japan to the world.
■ Event Details
Date: November 18, 2025 (Tuesday) 13:30 Registration begins / 14:00 Start - 16:30 End
Venue: Shin Marunouchi Building 10F EGG Event Space 1-5-1 Marunouchi, Chiyoda-ku, Tokyo 100-6510
Participation Fee: Free (lottery if there are many applicants)
Participation Registration: Prior approval required (host approval required) Click here to request participation
■ Agenda
- Mati Staniszewski (ElevenLabs CEO) "The Future of Voice AI and Japan"
- Hiroki Yamaguchi (Film Director / Representative of Gauma Pix Inc.) "The Production Process of the Generative AI Movie 'Grandma Revito'"
- Takahiro Abe (SI&C Consulting Unit / Associate Principal) "The Outlook for Voice-Based AI Agents in Business Transformation"
■ From the AICU media Editorial Department
Here is a video about ElevenLabs' product updates produced in August 2025.
Here is a video about ElevenLabs' product updates produced by AICU in August 2025.
This video explains in detail ElevenLabs, which is attracting attention as an AI audio platform, from its overview to the latest technology. While touching on support from well-known VCs and adoption records at companies such as Google and Zoom, we will introduce what kind of company it is, and in the video, we will delve into the various core technologies provided by ElevenLabs.
Main technologies: "Text-to-Speech (TTS)" that generates speech from text, "Speech-to-Speech (STS)" that converts speech to another voice while maintaining the original voice quality, "Voice Cloning" that duplicates a specific voice, "Sound Effects (SFX)" that generates sound effects from text prompts, and so on.
Product applications: "Studio" function for audiobooks and podcast production, AI dubbing function that supports 29 languages and maintains emotions and tones, and other specific usage examples are also introduced.
Latest model: The "Scribe" model, which realizes highly accurate transcription, will be explained in comparison with Whisper.
Voice Cloning: The differences between "Instant Voice Cloning," which can be created immediately with 30 seconds of audio, and "Professional Voice Cloning," which involves voiceprint authentication, and its safety are also explained.
Safety and Ethics: Measures against deepfakes and participation in Content Provenance and Authenticity (C2PA) to ensure safe use are also introduced.
Of particular note is the demonstration of the latest speech synthesis model "V3," which is demonstrated in the second half of the video.
Pay attention to how V3 enables natural news reading compared to conventional TTS voice. Furthermore, please listen to the amazing evolution of rich "emotional expression" (lines like a dragon in a cave), intonation in dialects (Hakata dialect), and "non-verbal expressions" such as laughter.
"Voice" will become the next creative foundation
Movies, animation, VTubers, games, advertising── Amidst the advancement of AI-based generation, the last remaining area of "humanity" is voice.
- Voice that puts "soul" into video
- Narration that guides the story
- Conversation that gives life to characters
- AI agent that transforms the customer experience
ElevenLabs is becoming a "voice OS" that integrates all of these. Please don't miss this opportunity.
Sessions with interpreters are also planned, so you can participate with peace of mind in both Japanese and English!
The Chroma Awards, sponsored by ElevenLabs, are also approaching the deadline!
The Chroma Awards A Film, Music, and Games competition to unite creators, commu chromaawards.com
