AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Features

  • MusicGen for text-to-music with optional melody conditioning
  • AudioGen for text-to-sound effects and ambient audio
  • EnCodec neural audio codec for discrete tokenization and efficient modeling
  • Ready-to-use checkpoints and straightforward Python/CLI inference
  • Training recipes and scripts for fine-tuning on custom datasets
  • Example notebooks and utilities for prompting, conditioning, and post-processing

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow AudioCraft

AudioCraft Web Site

You Might Also Like
Monitor your whole IT Infrastructure Icon
Monitor your whole IT Infrastructure

Know what's up and what's new: Monitor all your systems, devices, traffic and applications.

Caters to tech staff, system Administrators, and companies of any size, from small and medium sized businesses to enterprises that need their IT network to be reliable and easy to monitor in real-time. Equipped with an easy-to-use, intuitive interface with a cutting-edge monitoring engine. PRTG optimizes connections and workloads as well as reducing operational costs by avoiding outages while saving time and controlling service level agreements (SLAs).
Start Your Free PRTG Trial Now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of AudioCraft!

Additional Project Details

Programming Language

Python

Related Categories

Python Sound Audio, Python Libraries, Python Deep Learning Frameworks

Registered

2025-10-06