AudioSpectrogram

AudioSpectrogram is a powerful audio spectrogram generator that supports various audio formats and produces high-quality spectrograms. Built with Rust, it offers cross-platform support and runs on Windows, macOS, and Linux.

Features

Cross-platform support (Windows, macOS, Linux)
Drag-and-drop support on Windows
Multiple audio format support: WAV, MP3, FLAC, OGG, AAC, etc.
High-quality spectrogram generation using Turbo colormap
Automatic multi-channel audio processing (mixed to mono)
Precise time and frequency scales
Complete dB scale display (-120dB to 0dB)
Customizable FFT size and hop size

Sample Spectrogram

This is a sample spectrogram generated using our tool, showing OneRepublic's "Apologize" (44.1kHz sampling rate). The spectrogram clearly demonstrates:

Full frequency range (0-22.05kHz)
Clear time axis markers
Precise frequency scaling
Rich dynamic range display (-120dB to 0dB)

Requirements

Rust toolchain (recommended installation via rustup)
Cargo (Rust package manager, included with Rust)
System requires at least one monospace font:
- Windows: Consolas
- macOS: Monaco
- Linux: DejaVu Sans Mono

Building

Clone the repository:

git clone https://github.com/lmshao/AudioSpectrogram.git
cd AudioSpectrogram

Build the project:

cargo build --release

The executable will be available in the target/release directory.

Usage

Basic usage:

AudioSpectrogram -i input.mp3

On Windows, you can simply drag and drop an audio file onto the program icon, and it will automatically generate a spectrogram. This is the easiest way to use the program.

Alternatively, specify the file directly in the command line:

AudioSpectrogram input.mp3

Command Line Arguments

-i, --input <FILE>: Input audio file path
-o, --output <FILE>: Output image path (optional, defaults to input filename with .png extension)
-f, --fft-size <SIZE>: FFT size (optional, default: 4096)
-p, --hop-size <SIZE>: Hop size (optional, default: half of FFT size)

Examples

Generate spectrogram with default parameters:

AudioSpectrogram -i music.flac

Specify output filename:

AudioSpectrogram -i music.flac -o spectrum.png

Custom FFT parameters:

AudioSpectrogram -i music.flac -f 8192 -p 2048

Output Description

The generated spectrogram includes:

Vertical axis: Frequency scale (kHz)
Horizontal axis: Time scale (min:sec)
Right side: dB scale (-120dB to 0dB)
Color mapping: Using Turbo colormap, red indicates high intensity, blue indicates low intensity

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
resources		resources
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
build.rs		build.rs
resources.rc		resources.rc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AudioSpectrogram

Features

Sample Spectrogram

Requirements

Building

Usage

Command Line Arguments

Examples

Output Description

License

About

Uh oh!

Releases 1

Packages

Languages

License

lmshao/AudioSpectrogram

Folders and files

Latest commit

History

Repository files navigation

AudioSpectrogram

Features

Sample Spectrogram

Requirements

Building

Usage

Command Line Arguments

Examples

Output Description

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages