Skip to content

The Real-Time Speech Translation project demonstrates a Python application for real-time speech transcription and translation using machine learning and natural language processing (NLP) techniques. It provides a seamless interface for users to speak in one language and instantly see translations in English, enhancing multilingual communication

Notifications You must be signed in to change notification settings

AYUSHI-SHA/Speech_Translation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Real-Time Speech Translation

Overview

This project demonstrates a real-time speech translation application using Python and various libraries.

Dependencies

  • speech_recognition: Python library for speech recognition
  • googletrans: Python wrapper for Google Translate API
  • nltk: Natural Language Toolkit for NLP tasks
  • numpy: Library for numerical computations
  • pandas: Library for data manipulation and analysis
  • Other standard Python libraries

Key Features

  • Speech Recognition: Utilizes the Google Web Speech API through speech_recognition library to transcribe spoken words into text.
  • Language Translation: Integrates googletrans library to translate recognized text into multiple target languages.
  • Dynamic Language Selection: Supports a wide range of languages for both source and target translations.
  • Ambient Noise Adjustment: Enhances accuracy by adjusting for ambient noise levels during speech recognition.

Technologies Used

  • Python: Used for application development.
  • Libraries: speech_recognition, googletrans, numpy, pandas, re, nltk, sklearn.
  • Integrated Development Environment (IDE): Used Visual Studio Code

Setup

  1. Clone the repository:
  2. git clone https://github.com/your-username/real-time-speech-translation.git
  3. Install dependencies:
  4. pip install -r requirements.txt
  5. Run the application:
  6. python transcribe_translate.py

Usage

Upon running the application, follow the prompts to select the source language and begin speaking for real-time translation.

License

This project is licensed under the MIT License - see the LICENSE file for details.


Generated by README generator v1.0

About

The Real-Time Speech Translation project demonstrates a Python application for real-time speech transcription and translation using machine learning and natural language processing (NLP) techniques. It provides a seamless interface for users to speak in one language and instantly see translations in English, enhancing multilingual communication

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages