Hey everyone!
We just shipped Smallest AI (Waves) integration into OpenCut-AI, and we're excited to share what this unlocks for creators.
What's new
OpenCut-AI now has three voice engines built in:
1. Local (Coqui XTTS) -- Runs entirely on your machine. 12 languages, voice cloning, zero API keys needed. Best for offline editing and privacy-first workflows.
2. Sarvam AI -- Purpose-built for Indian languages. 22 regional languages for transcription, 11 for text-to-speech, with 23+ natural speaker voices. If you're creating content in Hindi, Tamil, Telugu, Bengali, or any Indian language -- this is the best engine for you.
3. Smallest AI (NEW) -- Ultra-fast cloud TTS and STT. ~100ms latency, 80+ voices across 15 languages, and speech-to-text covering 39 languages with speaker diarization and emotion detection. This is our most versatile engine yet.
What you can do with Smallest AI
- Generate voiceovers in English, Hindi, Spanish, Tamil, and 11 more languages with natural-sounding voices
- Transcribe audio/video in 39 languages with automatic subtitle generation
- Control speech speed from 0.5x to 2.0x
- Pick from 80+ voices -- each language has multiple male and female options
- Process long content -- auto-chunking handles videos of any length
How it works
1. Grab a free API key from app.smallest.ai
2. Paste it in Settings > API Keys > Smallest AI
3. Select "Smallest AI" in the Voiceover or Captions panel
4. Generate
That's it. No server setup, no model downloads, no GPU required.
We're open to adding more models
This is the part we're most excited about. OpenCut-AI is built with a pluggable engine architecture. Adding a new voice or transcription provider is straightforward, and we want the community to drive what comes next.
Models we're considering:
- ElevenLabs (premium voice synthesis)
- Deepgram (real-time STT)
- Fish Speech (open-source voice cloning)
- Kokoro TTS (lightweight and fast)
- StyleTTS 2 (human-level quality)
Have a model you'd love to see? Drop it in the comments with:
- What it does
- Why it matters for video editing
- A link to their docs
We'll prioritize based on what the community wants most.
Links
- GitHub: github.com/Ekaanth/OpenCut-AI
- Smallest AI Docs: waves-docs.smallest.ai
- Get a Smallest AI key: app.smallest.ai
We'd love to hear what models and features you want next. Let us know!
Free AI Video Editor OpenCutAI
Hey Product Hunt!
We're launching OpenCut AI - a fully open-source, local-first AI video editor.
We took [OpenCut](https://github.com/ekaanth/openc...), an open-source video editor, and built an entire AI suite on top of it:
- Edit by text - transcribe your video, then edit it like a Google Doc. Delete a sentence, and the video cuts itself.
- Voice cloning - clone any voice from a 6-second sample and generate voiceovers.
- AI image generation - create images from text and drop them on the timeline.
- Filler word removal - one-click removal of "um", "uh", "like", and "you know."
- Natural language commands - tell the editor "remove the intro" or "speed up the middle."
- Smart subtitles - karaoke, pill, and classic styles, generated instantly.
- Audio denoising - clean up background noise automatically.
- Podcast Clip Generator - AI finds the most viral-worthy 30–60 second moments from long podcasts.
- Word-Pop Karaoke Subs - Hormozi-style subtitles where each word pops up when spoken. 4 preset styles.
- Multi-speaker detection - auto-detect who's talking with pyannote AI. Video auto-cuts at speaker boundaries.
- Auto-Reframe 9:16 - face-tracking crop for TikTok, Reels, and Shorts.
- Brand Kit - define your brand once and apply intro/outro cards, lower thirds, watermarks in one click.
- Emotion detection - SpeechBrain AI detects emotional peaks to find the most impactful moments.
Everything runs locally. No cloud. No API keys. No subscriptions. No per-minute billing. Your videos never leave your machine.
Under the hood, we're standing on the shoulders of incredible open-source models like Whisper for transcription, XTTS v2 for voice cloning, Stable Diffusion for image generation, Llama 3.2 via Ollama for natural language commands, Pyannote for speaker diarization, and MediaPipe for face detection. All of it packaged in a single `docker compose up -d`.
We built this because video editing tools are either expensive, cloud-dependent, or closed-source and we wanted one that was none of those things.
Self-host it for as low as $20/mo, or just run it on your laptop.
Would love your feedback — what AI features would you want in a video editor?
Free AI Video Editor OpenCutAI
We just shipped multilingual Indian language support [see the thread](https://x.com/humblefool/status/2036125305908371528). OpenCut AI now supports 30+ languages including 22 Indian regional languages (Hindi, Tamil, Telugu, Kannada, Bengali, Malayalam, Marathi, Gujarati, Punjabi, Odia, and more) powered by [Sarvam AI](https://sarvam.ai). Transcribe, translate, and generate voiceovers with 37+ Indian voice speakers — all from within the editor. Add your Sarvam API key in Settings and you're good to go.
Would love your feedback — what AI features would you want in a video editor?