Spraak

Spraak is a lightweight, browser-based voice recorder and transcription app powered by Google's Gemini AI. Record audio directly from your microphone, and Spraak will transcribe your speech to text in seconds — no server required.

🌐 Live demo: https://gpx-go-78518.web.app/

Features

🎙️ In-browser recording — captures audio from your microphone using the MediaRecorder API
🤖 AI transcription — sends recordings to Google Gemini for accurate speech-to-text conversion
📋 One-click copy — copies the transcript to your clipboard instantly
🌓 Dark/light mode toggle — switch themes to match your preference
🔒 Private by default — your Gemini API key is stored only in your browser's localStorage
🚫 No build step — a single index.html file; open it and go

Prerequisites

A modern browser with support for the MediaRecorder API and Web Audio API (Chrome, Edge, Firefox, Safari 14.1+)
A free Google Gemini API key

Getting Started

1. Set your API key

Click the ⚙ (gear) button in the top-right corner and choose Set API Key. Enter your Gemini API key when prompted. The key is saved in localStorage and never leaves your browser except when calling the Gemini API directly.

2. Record

Click Start Recording and speak. The status indicator in the header changes from idle (amber) to live (green) while recording is active.

3. Stop and transcribe

Click Stop Recording. Spraak converts the captured audio to WAV format and sends it to the Gemini API. The transcribed text appears in the text area below the controls.

4. Use the transcript

Copy — copies all text to the clipboard
Clear Text — empties the text area so you can start fresh

How It Works

The browser captures audio via navigator.mediaDevices.getUserMedia and buffers it with MediaRecorder.
When recording stops, the raw audio blob (WebM/MP4 from the browser) is decoded with the Web Audio API and re-encoded as a WAV file in pure JavaScript.
The WAV data is Base64-encoded and sent as an inline payload to the Gemini generateContent endpoint.
Gemini returns a plain-text transcript, which is appended to the text area.

Running Locally

No build tools are required. Open index.html directly in a browser:

# With Python's built-in HTTP server (avoids some browser security restrictions on file:// URLs):
python3 -m http.server 8080
# Then open http://localhost:8080 in your browser

Deployment

Because browsers require a secure context to access the microphone, Spraak must be served over HTTPS. Deploy the files (index.html, manifest.webmanifest, and the icons/ folder) to any static web host that provides HTTPS — for example GitHub Pages, Netlify, Vercel, Cloudflare Pages, or your own server with a TLS certificate.

⚠️ Serving over plain http:// (except localhost) will cause the browser to block microphone access.

Tech Stack

Layer	Technology
UI	Vanilla HTML / CSS / JavaScript
AI	Google Gemini API (`gemini-3-flash-preview`)
PWA	Web App Manifest
CI/CD	GitHub Actions + Firebase Hosting

License

This project does not currently include a license file. Please contact the repository owner for usage terms.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.github		.github
icons		icons
.firebaserc		.firebaserc
.gitignore		.gitignore
README.md		README.md
app.js		app.js
firebase.json		firebase.json
index.html		index.html
manifest.webmanifest		manifest.webmanifest
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spraak

Features

Prerequisites

Getting Started

1. Set your API key

2. Record

3. Stop and transcribe

4. Use the transcript

How It Works

Running Locally

Deployment

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spraak

Features

Prerequisites

Getting Started

1. Set your API key

2. Record

3. Stop and transcribe

4. Use the transcript

How It Works

Running Locally

Deployment

Tech Stack

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages