Skip to main content

Preferences

Open the Preferences window from the Menu bar, or click Ctrl/Cmd + ,.

General Preferences

OpenAI API preferences

API Key - key to authenticate your requests to OpenAI API. To get API key from OpenAI see this article.

Base Url - By default all requests are sent to API provided by OpenAI company. Their api url is https://api.openai.com/v1/. Compatible APIs are also provided by other companies. List of available API urls you can find on discussion page

Default export file name

Sets the default export file name for file transcriptions. For example, a value of {{ input_file_name }} ({{ task }}d on {{ date_time }}) will save TXT exports as Input Filename (transcribed on 19-Sep-2023 20-39-25).txt by default.

Available variables:

KeyDescriptionExample
input_file_nameFile name of the imported fileaudio (e.g. if the imported file path was /path/to/audio.wav
taskTranscription tasktranscribe, translate
languageLanguage codeen, fr, yo, etc.
model_typeModel typeWhisper, Whisper.cpp, Faster Whisper, etc.
model_sizeModel sizetiny, base, small, medium, large, etc.
date_timeExport time (format: %d-%b-%Y %H-%M-%S)19-Sep-2023 20-39-25

Live transcript exports

Live transcription export can be used to integrate Buzz with other applications like OBS Studio. When enabled, live text transcripts will be exported to a text file as they get generated and translated.

If AI translation is enabled for live recordings, the translated text will also be exported to the text file. Filename for the translated text will end with .translated.txt.

Live transcription mode

Three transcription modes are available:

Append below - New sentences will be added below existing with an empty space between them. Last sentence will be at the bottom.

Append above - New sentences will be added above existing with an empty space between them. Last sentence will be at the top.

Append and correct - New sentences will be added at the end of existing transcript without extra spaces between. This mode will also try to correct errors at the end of previously transcribed sentences. This mode requires more processing power and more powerful hardware to work.

Advanced Preferences

To keep preferences section simple for new users, some more advanced preferences are settable via OS environment variables. Set the necessary environment variables in your OS before starting Buzz or create a script to set them.

On MacOS and Linux crete run_buzz.sh with the following content:

#!/bin/bash
export VARIABLE=value
export SOME_OTHER_VARIABLE=some_other_value
buzz

On Windows crete run_buzz.bat with the following content:

@echo off
set VARIABLE=value
set SOME_OTHER_VARIABLE=some_other_value
"C:\Program Files (x86)\Buzz\Buzz.exe"

Alternatively you can set environment variables in your OS settings. See this guide or this video more information.

Available variables

BUZZ_WHISPERCPP_N_THREADS - Number of threads to use for Whisper.cpp model. Default is 4.

On a laptop with 16 threads setting BUZZ_WHISPERCPP_N_THREADS=8 leads to some 15% speedup in transcription time. Increasing number of threads even more will lead in slower transcription time as results from parallel threads has to be combined to produce the final answer.

BUZZ_TRANSLATION_API_BASE_URl - Base URL of OpenAI compatible API to use for translation.

BUZZ_TRANSLATION_API_KEY - Api key of OpenAI compatible API to use for translation.

BUZZ_MODEL_ROOT - Root directory to store model files. Defaults to user_cache_dir.

BUZZ_FAVORITE_LANGUAGES - Coma separated list of supported language codes to show on top of language list.

BUZZ_LOCALE - Buzz UI locale to use. Defaults to one of supported system locales.

BUZZ_DOWNLOAD_COOKIEFILE - Location of a cookiefile to use for downloading private videos or as workaround for anti-bot protection.

BUZZ_FORCE_CPU - Will force Buzz to use CPU and not GPU, useful for setups with older GPU if that is slower than GPU or GPU has issues. Example usage BUZZ_FORCE_CPU=true. Available since 1.2.1

BUZZ_MERGE_REGROUP_RULE - Custom regroup merge rule to use when combining transcripts with word-level timings. More information on available options in stable-ts repo. Available since 1.3.0