OpenStudio

Your voice. Your frequency. No permission required.

Try the Live Demo · Report Bug · Sponsor

Somewhere right now, a community radio host is calculating whether they can afford another month of their streaming platform. A podcast collective just lost their entire archive because a service shut down. An independent voice got silenced — not by censorship, but by a credit card expiration.

OpenStudio exists because broadcasting should not require permission. No account creation. No monthly invoice. No terms of service between you and your audience. You clone a repo, you start broadcasting. Your station runs on your hardware. Your audio never touches a server you don't control.

This is a broadcast studio built the way radio was meant to work — direct, unmediated, yours. Vanilla JavaScript. Web Audio API. No framework, no build step, no dependency you didn't choose. Self-host it on a Raspberry Pi, a $5 VPS, a closet server at the back of your hackerspace. The entire client is under 50KB. If you can run Node, you can run a station — rent free.

Connect guests over WebRTC mesh. Mix-minus gives every participant broadcast-quality monitoring — the same technique used in professional studios, now running in a browser tab. Stream to unlimited listeners through Icecast. Record every voice on its own track for post-production. No platform stands between your signal and the world.

Quick Start

git clone https://github.com/msitarzewski/openstudio.git
cd openstudio && npm install && npm start
# Open http://localhost:6736

One command. One process. One port.

Why OpenStudio?

	OpenStudio	Riverside	Zencastr	StreamYard
Price	Free / self-host	$29/mo	$20/mo	$25/mo
Recording	Per-track WAV + mix	Per-track	Per-track	Mix only
Self-hosted	Yes	No	No	No
Privacy	Zero tracking	Cloud-dependent	Cloud-dependent	Cloud-dependent
Max participants	15 (mesh)	8	15	10
Setup time	30 seconds	Account + payment	Account + payment	Account + payment
Open source	MIT	No	No	No

How It Works

flowchart TB
    subgraph Browsers["Browsers — peer-to-peer over WebRTC mesh"]
        direction LR
        B1["🎙️ Host"]
        B2["🎙️ Caller"]
        B3["🎙️ Caller"]
        B1 <--> B2
        B2 <--> B3
        B1 <--> B3
    end

    subgraph Studio["Inside each browser"]
        WebAudio["Web Audio Graph<br/>Mix-Minus + Program Bus"]
        Rec["MediaRecorder<br/>per-track + program mix"]
        UI["Capability-gated UI<br/>modal explains missing prereqs"]
    end

    subgraph Signaling["Signaling Server (Node, ~6736)"]
        WS["WebSocket<br/>JWT rooms · RBAC · rate limits"]
        Static["Static files + listener proxy"]
        Caps["/api/capabilities"]
        Export["/api/export/<br/>clean · zip · transcribe · show-notes"]
    end

    subgraph AI["Optional AI Tooling — capability-gated"]
        FF["ffmpeg<br/>silencedetect · loudnorm · MP3"]
        Whisper["whisper.cpp<br/>on-device transcription"]
        LLM["OpenAI-compatible LLM<br/>LM Studio · Ollama · OpenAI · Groq"]
    end

    Icecast[("Icecast<br/>broadcast to listeners")]

    Browsers <-->|signaling| WS
    Browsers --> WebAudio
    WebAudio --> Rec
    UI -.reads.-> Caps
    Rec -->|upload| Export
    Export --> FF
    Export --> Whisper
    Export --> LLM
    WebAudio -->|live source| Icecast
    Static -->|/stream/*| Icecast
    Icecast --> Listeners[["📻 Listeners<br/>(unlimited, anywhere)"]]

Mix-minus is a broadcast engineering standard — each participant hears everyone except themselves. No echo, no feedback. Professional studios have done this with hardware for decades. OpenStudio does it in the browser with the Web Audio API. Everything outside the green AI box is required; the AI tooling is optional and the UI tells you exactly what's missing if you click a gated feature.

Features

Broadcast core

WebRTC mesh — peer-to-peer audio, no media server, up to ~15 participants
Mix-minus per participant — O(N) phase-inversion, broadcast-standard monitoring with zero echo
Per-participant gain, mute, and segmented LED level meters with waveform oscilloscope
Microphone input selector — enumerates all OS audio devices
Icecast streaming — broadcast to unlimited listeners through your own domain
WebSocket streaming fallback for Safari (browsers without ReadableStream upload)
Listener proxy at /stream/* so the entire app runs on a single port

Recording & post-production

Multi-track recording — per-voice WAV/WebM + program mix, all captured client-side
Single-zip bundle download for the whole session
Audio cleaning pipeline — silence detection, filler-word splice, two-pass loudness normalization to −16 LUFS (requires ffmpeg + whisper.cpp for the splice transcript)
Raw or Clean export with WAV / WebM / MP3 (podcast-ready) output (MP3 requires ffmpeg)

Optional AI tooling — capability-gated, runs on your hardware, see setup below

On-device transcription via whisper.cpp — no cloud API, no third party (requires whisper.cpp + Whisper model + ffmpeg)
LLM-generated show notes — episode title, summary, timestamped segment markers (requires transcription + any OpenAI-compatible LLM endpoint)
Markdown export — copy to clipboard or download as .md
Gated features show as disabled with an info icon and a setup modal — never a cryptic error

Security & ops

JWT room tokens (24 h) and scoped invite tokens (4 h)
Role-based access — host, ops, guest (caller) with producer-authoritative mute
256 KB WebSocket message cap, per-IP connection limit, sliding-window rate limits
CORS allowlist (open by default for dev), hardened security headers, sanitized listener proxy
Self-hosted variable fonts (Inter, JetBrains Mono, Space Grotesk) — no Google Fonts CDN dependency

Zero dependencies in the browser — vanilla JavaScript, Web Audio API, no framework, no build step. The entire client is under 50 KB.

Optional AI Tooling

OpenStudio includes a complete post-production pipeline — transcription, show notes, MP3 export. It's optional: broadcasting, recording, per-track downloads, and zip bundles all work without any of this. AI features ship enabled by default but are capability-gated — when a prerequisite is missing, the button shows as disabled with an info icon. Click it and a modal explains exactly what to install. Nothing is removed from the UI, nothing fails with a cryptic stack trace.

Prerequisites

ffmpeg and ffprobe on your PATH (most package managers ship both: brew install ffmpeg, apt install ffmpeg)
~1.5 GB free disk space for the default Whisper model

whisper.cpp setup (one-time)

git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp && make -j$(nproc)
cd .. && mkdir -p models
wget -O models/ggml-medium.bin https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-medium.bin

You can swap the model — ggml-tiny.bin (~75 MB) is faster but less accurate; ggml-large.bin is the other direction. The server looks for models/ggml-medium.bin by default.

LLM Provider Examples

Show notes call any OpenAI-compatible chat completions API. Configure via .env. Local providers need no API key; cloud providers require LLM_API_KEY.

LM Studio (default — nothing to configure):

# LM Studio runs at localhost:1234 by default; OpenStudio assumes this.
# Just start LM Studio and load a chat model. No env vars required.

Ollama (with the OpenAI-compatible shim):

LLM_BASE_URL=http://localhost:11434/v1
LLM_MODEL=llama3.3

OpenAI:

LLM_BASE_URL=https://api.openai.com/v1
LLM_MODEL=gpt-4o-mini
LLM_API_KEY=sk-...

Together AI:

LLM_BASE_URL=https://api.together.xyz/v1
LLM_MODEL=meta-llama/Llama-3.3-70B-Instruct-Turbo
LLM_API_KEY=...

Groq:

LLM_BASE_URL=https://api.groq.com/openai/v1
LLM_MODEL=llama-3.3-70b-versatile
LLM_API_KEY=gsk_...

Anthropic — the direct Messages API is not OpenAI-compatible. Run a shim like anthropic-openai-compat or litellm in front of it, then point LLM_BASE_URL at the shim.

Feature Gating

AI features are visible in the UI by default. The server publishes a GET /api/capabilities snapshot reporting which prerequisites are actually present — ffmpeg, ffprobe, whisper.cpp binary, Whisper model file, configured LLM endpoint. The frontend reads this on load and disables any button whose prereqs are missing. Clicking a gated button opens a modal with the exact install commands. No hidden flags, no "feature available in pro" dark patterns — capability is derived from what your runtime can actually do.

What happens without AI setup

Transcribe, Show Notes, and MP3 export show as disabled until their prereqs are met — clicking them opens the install modal. Every other feature — live broadcast, recording, per-track download, zip bundle, Icecast streaming — works without any AI setup.

Try It

Open openstudio.zerologic.com
Enter a station name and click Start Broadcast
Allow microphone access when prompted
Share the invite URL with a co-host (or open it in a second browser tab)
Talk — you'll hear each other with zero echo thanks to mix-minus
Hit Record to capture per-voice tracks, then Download when done

Broadcasts auto-expire after 15 minutes on the demo. Self-host for unlimited airtime.

Architecture

For detailed architecture documentation, see docs/ARCHITECTURE-IMPLEMENTATION.md.

Stack: Node.js · WebSocket · WebRTC · Web Audio API · Icecast · coturn

Roadmap

0.1 — Core studio: WebRTC mesh, mix-minus, mute controls, Icecast streaming
0.2 — Single-server deploy, multi-track recording, live demo
0.2.1 — Security hardening: JWT room tokens, rate limiting, CORS, RBAC
0.3 — Power Move: Whisper.cpp transcription, audio cleaning pipeline, Clean/Raw export, self-hosted fonts
0.3.1 — MP3 export fix, single-zip bundle download, configurable LLM endpoint
0.4 — Invite-link UI, DHT station discovery, Nostr NIP-53 integration, Ed25519 station identities
0.5 — SFU for larger rooms (25+ participants), soundboard, text chat

Development

# Development mode with hot reload
npm run dev

# Run tests
npm test

# With Docker services (Icecast + coturn)
docker compose up -d
npm start

See docs/vision.md for the full project vision and philosophy.

Known Gaps

Honest about what's there and what isn't:

Invite-link UI — the server can mint scoped invite tokens (host / ops / guest, 4 h TTL), but the host UI doesn't expose a button yet. For now, hosts share the room URL manually. Coming in 0.4.
AI pipeline setup is manual — the whisper.cpp build, model download, and LLM configuration aren't scripted. A setup-ai.sh would be a great PR.
whisper.cpp gitlink — the repo references whisper.cpp as a gitlink without a .gitmodules entry. Use the manual git clone in the AI setup section above; git submodule update will fail.
Mesh scale ceiling — WebRTC mesh tops out around 15 participants. Larger rooms need an SFU (planned for 0.5).

Contributing

PRs welcome! Please read the existing code before contributing — the codebase is intentionally minimal.

Fork the repo
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes
Push to the branch
Open a Pull Request

Sponsor

OpenStudio is free, open-source, and built by independent developers. If it's useful to you, sponsor the project on GitHub to keep it that way.

License

MIT — use it however you want.

talk hard.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.claude		.claude
.devcontainer		.devcontainer
.github		.github
deploy		deploy
docs		docs
icecast		icecast
memory-bank		memory-bank
server		server
shared		shared
tests		tests
web		web
whisper.cpp		whisper.cpp
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
claude.md		claude.md
dev-switch.sh		dev-switch.sh
dev.sh		dev.sh
docker-compose.yml		docker-compose.yml
package.json		package.json
run-pre-validation.sh		run-pre-validation.sh
station-manifest.sample.json		station-manifest.sample.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenStudio

Quick Start

Why OpenStudio?

How It Works

Features

Optional AI Tooling

Prerequisites

whisper.cpp setup (one-time)

LLM Provider Examples

Feature Gating

What happens without AI setup

Try It

Architecture

Roadmap

Development

Known Gaps

Contributing

Sponsor

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenStudio

Quick Start

Why OpenStudio?

How It Works

Features

Optional AI Tooling

Prerequisites

whisper.cpp setup (one-time)

LLM Provider Examples

Feature Gating

What happens without AI setup

Try It

Architecture

Roadmap

Development

Known Gaps

Contributing

Sponsor

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages