Voice Translator with Gemini API

This is a web application that provides a seamless voice translation experience. It captures audio from the user's microphone, transcribes it, translates the text using the Google Gemini API, and can narrate the translated text back to the user.

✨ Key Features

Voice Transcription: Captures microphone input and transcribes speech to text.
Conversational Translation: Intelligently groups spoken phrases during natural pauses, sending them to the Gemini API for accurate, context-aware translation.
Text-to-Speech Narration: Reads the translated text aloud using the browser's built-in speech synthesis.
Translation History: Saves completed translation sessions for later review.
Multi-Language Support: Supports a wide range of source and target languages for both transcription and translation.
Bilingual UI: The application interface is available in both English and Spanish.

🛠️ Tech Stack

Frontend: React with TypeScript
AI Model: Google Gemini (gemini-2.5-flash) via @google/genai SDK
Build Toll: Vite
Web APIs: Web Speech API (for recognition) & Speech Synthesis API (for narration)
Styling: Tailwind CSS

🚀 Getting Started (Local Development)

Follow these instructions to get a copy of the project up and running on your local machine.

Prerequisites

Node.js (version 18 or newer recommended)
A modern web browser with support for the Web Speech API (Google Chrome is recommended).
A Google Gemini API Key. You can get one from Google AI Studio.

Installation & Setup

Clone the repository:

git clone https://github.com/your-username/voice-translator.git
cd voice-translator

Install dependencies:
```
npm install
```
Create the environment file: This project uses a .env.local file to manage the API key securely. Create this file in the root of the project:
```
touch .env.local
```
Add your API Key: Open the .env.local file and add your Google Gemini API key. It is crucial that the variable name starts with VITE_ for it to be accesible in the browser.
```
VITE_API_KEY=YOUR_GEMINI_API_KEY_HERE
```
Run the development server:
```
npm run dev
```
Then, open your browser and navigate to the URL provided by Vite (usually http://localhost:5173).

Note

I've been unable to separate the input and output audio in the mobile settings, nor in the browser app settings, such as this one. I haven't been able to access the PC audio or the headphone audio from the browser.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
components		components
hooks		hooks
public		public
services		services
.gitignore		.gitignore
App.tsx		App.tsx
LICENSE		LICENSE
README.md		README.md
REPORT.md		REPORT.md
constants.ts		constants.ts
index.css		index.css
index.html		index.html
index.tsx		index.tsx
metadata.json		metadata.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
types.ts		types.ts
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Translator with Gemini API

✨ Key Features

🛠️ Tech Stack

🚀 Getting Started (Local Development)

Prerequisites

Installation & Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Voice Translator with Gemini API

✨ Key Features

🛠️ Tech Stack

🚀 Getting Started (Local Development)

Prerequisites

Installation & Setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages