(TODO) Audio

how to get an audio file

  • long
  • short
  • sync
    • audio
    • timecode with text

AI

The services for SFX and Music generally fall into two categories: Library/Search (for existing assets) and Generative (for creating new assets).

Here is a combined table showing the API availability for both Search/Library and Generative functionality for the services you mentioned.

ServicePrimary CategoryAPI for GenerationAPI for Search/LibraryNotes
Freesound.orgSFX / LibraryNo[ ] YesOffers a comprehensive API for searching, browsing, and retrieving sound effects based on keywords and content analysis.
Unity Asset StoreSFX & Music / LibraryNoYes (In-Editor)Search functionality is accessible programmatically within the Unity Editor using its specialized search provider.
ElevenLabsVoice/SFX / Generative[ ] YesNo (Library)Primarily a Text-to-Speech (TTS) and Text-to-SFX generation API. No API for searching a public library of existing SFX.
Audo.aiAudio ProcessingNoNoThe API is for noise removal and speech enhancement, not for generating or searching a music/SFX library.
AIVAMusic / GenerativeYesUnconfirmedFocused on generating music based on parameters. API access is primarily for generation. Public search API is not explicitly documented.
SoundrawMusic / Generative[ ] Yes[ ] Yes (By Criteria)API for generating original music based on parameters (genre, mood, etc.). This acts as a search by criteria for original content.
MubertMusic / Generative & Library[ ] Yes[ ] YesOffers both a generative API (Text-to-Music, Image-to-Music) and access to a curated library that can be filtered by genre, mood, or BPM.
Amazon PollyVoice / GenerativeYesNoText-to-Speech (TTS) API. No API for searching a music/SFX library.
Google Gemini/TTSVoice / GenerativeYesNoText-to-Speech (TTS) via the Gemini API or Cloud TTS. No API for searching a music/SFX library.

Key Takeaways:

  • For finding existing SFX/sounds, Freesound.org is the primary service with a search API.
  • For generative music that acts as a search-by-criteria, Soundraw and Mubert both offer APIs.
  • Mubert is unique in that it offers both music generation and search/filtering on its curated tracks.

How to Convert Text to Speech in Python

  • gTTS
  • pyttsx3
  • openai
  • Huggingface Transformers

tts

online

offline