Skip to content
@linto-ai

linto.ai

Your Open Source end-to-end platform for voice-operated solutions

LinTO AI

Open Source Ecosystem for Transcription, Collaborative Media Management, Annotation, Live Subtitling, and Summarization

LinTO AI Banner

Overview

LinTO AI provides a powerful suite of open-source tools for transcription, collaborative media editing, annotation, live subtitling, and summarization utilizing large language models (LLMs).

Hosted by
LINAGORA
Try LinTO Studio

Quick Start

  • LinTO Studio: 🎤 A media management platform offering advanced tools for transcription and collaborative media editing. Key features include:
    • Speaker identification/diarization: Automatically segment and identify speakers.
    • Automatic timestamp alignment: Synchronize transcripts with media.
    • Collaborative editing: Work collaboratively on media annotations and transcriptions in real-time.
    • Summarization: Generate concise summaries of media content using LLMs.
    • Building and syncing subtitles: Create and synchronize subtitles for video content with ease.
    • Live transcription from the browser: Record and transcribe audio directly from your browser.
    • AI Agent for videoconferences: A bot system that joins videoconferences to capture live audio streams for transcription and subtitling. This allows LinTO Studio to act as a powerful assistant during meetings, leveraging videoconference platforms as live audio sources.

LinTO Studio leverages our other technologies, including:

  • LinTO-STT for speech-to-text conversion.
  • LinTO-Live-Plugins designed to operate transcription on inbound audiovisual streams (SRT, RTMP, WS...).
  • LinTO-Diarization for speaker segmentation and identification.
  • LLM-Gateway for advanced summarization.

To deploy LinTO Studio and its associated services, use the LinTO Deployment Tool, which simplifies the setup process. Learn more

Key Projects

  • LinTO-STT: 🗣️ An automatic speech recognition API supporting both offline and real-time transcriptions. It accommodates models like Kaldi and Whisper and can operate as a standalone service or within a microservices infrastructure. Learn more

  • LinTO-Studio-PLugins: Designed to operate and manage, at scale, transcription sessions from inbound audiovisual streams. Particularly in enterprises or structures managing multiple meeting rooms, whether physical or virtual. The project connects multiple automatic speech recognition (ASR) providers to enable transcription of multilingual meetings. Its primary objective is to provide users with live closed captions and the ability to download transcripts of past sessions. In other words, the project bridges audio streams, with SRT streams as a first-class citizen, to ASR providers and manages transcripts, including real-time delivery on screen and downloadable artifacts. Learn more

  • Whisper-Timestamped: ⏱️ A multilingual automatic speech recognition tool providing word-level timestamps and confidence scores. It enhances OpenAI's Whisper models to deliver more precise transcriptions with detailed timing information. Learn more

  • LLM-Gateway: 📝 A service dedicated to rolling summarization using large language models (LLMs), enabling efficient processing and summarization of extensive textual data. Learn more

  • LinTO-Diarization: 🔊 A speaker diarization service that segments audio streams into homogeneous segments based on speaker identity, with capabilities for speaker identification when audio samples of known speakers are provided. Learn more

  • WebVoiceSDK: 🌐 A JavaScript library offering lightweight and optimized building blocks for always-listening voice-enabled applications directly in the browser. It manages various aspects of voice input, including hardware microphone handling, voice activity detection, and wake word detection. Learn more

Get Involved

LinTO AI is committed to open-source development, ensuring our tools are accessible and adaptable, fostering innovation in business-aware media transcription and summarization. For more information or to contribute, contact us at [email protected].

Pinned Loading

  1. linto-stt linto-stt Public

    An automatic speech recognition API

    Python 71 19

  2. linto-studio linto-studio Public

    Transcription and annotation interface for recorded audio or video files

    JavaScript 42 4

  3. whisper-timestamped whisper-timestamped Public

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Python 2.6k 201

Repositories

Showing 10 of 51 repositories

Top languages

Loading…

Most used topics

Loading…