Glossary
This page gives very brief explanations of technical terms related to digital audio, with some links to Wikipedia for much more comprehensive explanations.
ToDo This or all table layouts may need to be customised to include horizontal lines. This page seems very tiring to look at without those.
This concept is being tested with prettytableb in common.css
In Opera, columns are drawn, not what we wanted... - Gale
Even after 1.4.0 if needs be, we must also have an Index. And when we do, I think these terms are part of it (because of the overlap between the Index to our pages and the Glossary items). -Gale
JC: I've updated Cepstrum and Noise Floor definitions, and think most of this editornote can now go. However there is scope (possibly) for saying more about noise floor and compressor, though possibly on the compressor page. With regard 'wavelength' we don't need a strict policy of removing glossary terms that currently aren't actually used in the text - it didn't have to be removed, but we don't have to reinstate it either.
The following two definitions need to be addressed. I believe someone more knowledgeable that myself should take a stab at these. - John
Cepstrum:
Some description here. It's like a logarithm of a Fourier Transform. We need a better description [1]
Noise Floor:
A level or amplitude representing the amount of noise present in the signal./// Gale: How do you distinguish in this description from noise that is above this level such as HF whine but which does not obliterate the other audio?
I guess I'm not sure what you are driving at. Can you point me to an Audacity page that discusses your point ? - John
Bill: Quoting wikipedia: "noise floor is the measure of the signal created from the sum of all the noise sources and unwanted signals within a measurement system. The noise floor limits the smallest measurement that can be taken with certainty since any measured amplitude can on average be no less than the noise floor."
In my experience, when recording engineers talk about noise floor this is what they are talking about. In their non-rigourous way they are using it interchangeably with signal to noise ratio. It's easier to say "the signal is below the noise floor" than to say "the level of the signal relative to 0 dB is below the signal to noise ratio".
Addressing your point, Gale, the HF whine would certainly be "noise" to a listener, and it is an 'unwanted signal', but because it can be identified by amplitude and frequency it is, I believe, above the noise floor - it can be measured.
The Compressor effect has a "noise floor" slider. In my understanding the purpose of this slider is to prevent the upward compression of noise. In effect it creates a second threshold or inflection point on the I/O transfer curve of the compressor. Given that the transfer curve below the main threshold gives upward compression, the curve below the "noise" threshold would provide downward expansion. The Leveller effect has a "threshold for noise" popup that appear to do the same thing. Thus I think that our definition of noise floor should at least help users understand the use of these controls.
So, I think John's definition is good. Technically you might want to replace 'signal' with 'system', but for our purposes I think that would be confusing.
Other Items formerly in this note have been incorporated below and original notes moved to Discussion Page.
Note that the glossary term wavelength isn't actually used anywhere in the manual. Definition Removed.
General Terms
| Term | Description | |
|---|---|---|
|
ADC: | Analog to digital converter. The part of a sound card which records an analog, real world sound like a voice or guitar and converts it to a numerical representation of the audio that a computer can manipulate. |
|
Algorithm: | A set of steps or a procedure that will produce a desired result. |
|
Amplitude: | The level or magnitude of a signal. Audio signals with a higher amplitude will sound louder. |
| Audacity Project Format (.aup): | The format in which Audacity stores its projects. This consists of a reference file with the extension .aup and a large number of small audio files with extension .AU. This structure makes it quicker for Audacity to move audio around - ideal for cutting and pasting audio in a project. | |
|
Audio CDs: | CDs containing PCM audio data in accordance with the Red Book standard. They can be played on any standalone CD player as well as on computers. |
|
Bit: | A measure of quantity of data. A bit is one binary digit, a 0 or a 1. |
|
Bit Rate: | The number of computer bits that are conveyed or processed per unit of time. Normally expressed in kilobits per second (kbps). |
|
CBR: | Constant Bitrate - The rate at which audio in this format uses its data does not vary. Silence uses as much 'space' as audible sound. |
|
Cepstrum: |
The cepstrum of an audio signal is related to the spectrum, but presents the rate of change in the different spectrum bands. It's particularly useful for properties of vocal tracks and is used, for example, in software to identify speakers by their voice characteristics. |
|
Clipping: | Distortion to sound that happens when the audio is too loud. When a waveform shows 'flat tops' rather than smooth curves it is usually an indication of clipping. |
|
Compressed Audio Format: | Any format that will reduce the space required in storing or representing an audio signal. Space savings can be made for example by discarding certain frequency components which may be inaudible. MP3 takes this approach. Other formats such as FLAC compress without audio loss, but achieve lower compression rates. |
|
Compression: | A process that tends to even out the overall volume level by increasing the level of softer passages and decreasing the level of louder passages. See also Compressed Audio Format. |
|
Cycle: | An audio tone consists of an oscillating sound pressure on the ear. One cycle is one full transition of positive pressure through to negative pressure, back to positive pressure again. |
|
DAC: | Digital to analog converter. The part of a sound card which plays back a numerical representation of audio as an analog, real world sound like a voice or guitar. |
|
Data CDs: | Data CDs contain data intended to be read directly by a computer. The data may include audio and any other types of file such as images and documents. Most standalone CD players will not play data CDs, but some DVD players will. Including compressed audio files on a data CD can greatly increase the playing time compared to audio CDs. |
|
dB: | Decibels. A logarithmic unit (typically of sound pressure) describing the ratio of that unit to a reference level. |
|
Dynamic Range: | The difference between the loudest and softest part in an audio recording, the maximum possible being determined by its sample format. For a device, the difference between its maximum possible undistorted signal and its Noise Floor. |
|
FFT: | Fast Fourier Transform. A method for performing Fourier transforms quickly. |
|
File name extension: | A suffix of three or four characters added to a file name which defines the format of its contents. The suffix is separated from the file name by a dot (period), as in "song.mp3". The extension of common formats is often hidden on Windows, but can be turned on in the system's Folder Options. |
|
Filter: | A sound effect that lets some frequencies through and suppresses others. |
|
Fourier Transform: | A method for converting a waveform to a spectrum, and back. |
|
Frequency: | Audio frequency determines the pitch of a sound. Measured in Hz (see below), higher frequencies have higher pitch. |
|
Gain: | How much to amplify the sound by. |
|
Interpolation: | Completing waveform data by estimating missing values. The values are estimated as being between other known values. To convert a waveform recorded at 22000 Hz or samples per second to one at a higher rate such as 44000 samples per second requires interpolation. |
|
Harmonics: | Most sounds are made up of a mix of different frequencies. In musical sounds, the component frequencies are simple multiples of each other, for example 100 Hz, 200 Hz, 300 Hz. These are called harmonics of the lowest frequency sound. |
|
High Pass Filter: | A filter that lets high frequencies through |
|
Hz: | Hertz. Measures a frequency event in number of cycles per second. See Frequency and Sample Rate, both of which are measured in Hz. |
|
LAME: | A software library that converts audio to MP3 format. |
|
Latency: | A short delay between an audio signal being sent and received. In computer audio this is due to analog-to-digital and digital-to-analog conversion. Most commonly refers to the delay between recording a sound and a) hearing its playthrough or b) laying it down on disk. |
|
Linear: | A simple, straight-forward, directly proportional, one-to-one relationship. A volume control would be a linear control. This term is used to contrast with logarithmic, or other complex relationships. |
|
Logarithmic: | A non-linear relationship where one item is proportional to the logarithm of the other item. Some measures, such as dB, are logarithmic by definition. |
|
Lossless: | A format for size-compressing audio that does not lose any information. The quality is exactly as good as before compression. An example is FLAC. |
|
Lossy: | A format for size-compressing audio that may sacrifice a small amount of quality in order to reduce the file size more than lossless compression. Examples are MP3 and OGG. |
|
Low Pass Filter: | A filter that lets low (bass) frequencies through. |
|
MP3 CDs: | A specific type of data CD containing only MP3 audio files. All computers can play them as can some DVD and portable MP3 players. |
|
Noise Floor: | A level or amplitude representing the amount of near-continuous background noise present in the signal. A background hiss would raise the noise floor, and could prevent a faint signal (one below the noise floor) being heard at all. Unwanted sporadic noise such as a member of the audience coughing is noise, but it doesn't contribute to the noise floor. |
|
PCM: | Pulse code modulation. A method of converting audio into binary numbers to represent it digitally, then back to audio. The waveform is measured at evenly spaced intervals and the amplitude of the waveform noted for each measurement. |
|
Pitch: | Generally synonymous with the fundamental frequency of a note, but in music, often also taken to imply a perceived measurement that can be affected by overtones above the fundamental. |
|
Red Book: | The most widely used standard for representing audio on CD, requiring stereo, 16-bit, 44100 Hz. |
|
RMS: | Root-mean-square. A method of calculating a numerical value for the average sound level of a waveform. |
|
Sample: | A discrete value at a point in a waveform representing the audio at that point. Also the act of taking a sequence of such values. All digital audio must be sampled at discrete points. By contrast, analog audio (such as the sound from a loudspeaker) is always a continuous signal. |
|
Sample Rate: | Measured in Hz like frequency, this represents the number of digital samples captured per second in order to represent the waveform. |
|
Sample Format: | Also known as Bit Depth or Word Size. The number of computer bits present in each audio sample. Determines the dynamic range of the audio. |
|
Spectrum: | Presentation of a sound in terms of its component frequencies. |
| Uncompressed Audio Format: | An audio format in which every sample of sound is represented by a binary number. Examples are WAV or AIFF. | |
|
VBR: | Variable bit rate. A method for compressing audio which does not always use the same number of bits to record the same duration of sound. |
|
Waveform: | A visual representation of an audio signal. |
Audio File Formats
- Description of FFmpeg formats (WMA, AC3, AAC, AMR NB)
- As we have now decided not to have separate pages for the different formats (only separate pages for format export options), we need to rethink where these Glossary entries link to - best left until we decide if we have an appendix containing details of "audio file formats" - Gale
- Ed 13 Nov 2009 consider adding MP4 (aka M4A) formats which are also supported (see wikipedia article)
| Term | Description | |
|---|---|---|
|
AIFF: | A container format, almost always used for lossless, uncompressed, PCM audio. The format is in Apple's Big-endian byte order. |
|
AU: | A container format, used by Audacity for storage of lossless, uncompressed, PCM audio data |
|
FLAC: | An Open Source lossless, size-compressed audio format |
|
MIDI: | MIDI is a small-sized file format which stores how to play notes, widely used for keyboard instruments. It is not an audio file format like WAV that uses thousands of samples to record the full sound of the notes actually being played. |
|
MP2: | A lossy, size-compressed audio format mainly used by the broadcast media |
|
MP3: | A lossy, size-compressed audio format which is the main format for transmitting audio over the internet |
|
Ogg Vorbis: | An Open Source lossy, size-compressed audio format |
|
WAV: | A container format, almost always used for lossless, uncompressed, PCM audio. The format is in Microsoft's Little-Endian byte order. |
|
AAC: | AAC is a lossy, size-compressed audio format. Designed to be the successor of the MP3 format. AAC generally achieves better sound quality than MP3 at similar bit rates. AAC's best known use is as the default audio format of Apple's iPhone, iPod, iTunes, and the format used for all iTunes Store audio. AAC is also the standard audio format for Sony's PlayStation 3. |



