Analyze Menu

From Audacity Development Manual
Revision as of 02:02, 18 December 2007 by Windinthew (talk | contribs) (Find Clipping...: added another sentence (better explanation of Threshold welcome, if anyone has one)...)
Jump to: navigation, search

Analyze contains tools for finding out about the characteristics of your audio, or labeling key features. Plug-ins that accept input but produce no output will also be placed in the Analyze menu.


ToDo Sort the table out
If we don't want table and cell borders with Wikitable class (and if we still use tables for layout, we probably don't) it looks if borders have to be turned off in Common.css if this behaves like other Wikis, but http://www.audacityteam.org/manual/index.php?title=MediaWiki:Common.css seems to have no content? Update - it looks as if the code I added has worked if you call the table "prettytable". The actual code was only an experiment to see if anything happened. I'm assuming "wikitable" (the code is I think the same as for "prettytable") won't work for a while because of caching, but I don't know the active directory name to issue "action=purge".
Unless this table is merely a summary reference point, it isn't going to work - look at the text in plot spectrum alone (which is needed). Or is Plot Spectrum an "advanced" menu item (like some of the advanced effects) which has its own page? - Gale

James: Summary Reference. Yes. Track Pop-Down Menu has the right level of detail for a reference. Having more detail linked from it is icing on the cake. For example, the track pop down menu description doesn't show a screenshot of the dB view. It doesn't need to have it, but we could add it (as a link) if we want and have time to. The Plot Spectrum item is complex enough to merit a page, I think.

Gale: There'd be a stronger case for a link to Spectrum and Pitch views I think. Very undecided at present otherwise. Trying for the Track Pop-Down Menu solution may well be time consuming if we have to accommodate screen readers - it clearly is not acceptable now from that point of view, and it may possibly not work on the Effect Menu page simply because of the length. We can carry on adding text but I am not sure we can decide (as you said) until Effect Menu is finished. Also if we don't have the "Pop-Down" solution, then as regards this page, possibly it just is not long enough to justify a separate page for Plot Spectrum, i.e. maybe it would look better here.


Menu Description
Contents of the Analyze menu

Plot Spectrum text

Find Clipping text


Beat Finder text

Regular interval labels text

Silence finder text

Vamp Plug-ins will appear in this menu too, if they are installed.


Plot Spectrum...

Frequency Analysis window of a music track

This takes the selected audio (which is a set of sound pressure values at points in time) and converts it to a graph of frequencies against amplitudes. This is done using a complicated piece of maths known as a Fast Fourier Transform or FFT. This gives a value for each narrow band of frequencies that represents how much of those frequencies is present. All the values are then interpolated to create the graph. Note that currently only the first 23.8 seconds of selected audio can be analyzed.

The following drop-down boxes let you configure the way the plot is calculated:

Algorithm

Determines what type of processing is done on the audio data.

  • Spectrum: (default) - Plots the FFT of the data as described above.
  • Autocorrelation: These options measure to what extent the sound repeats itself. This is done by taking two copies of the audio, and moving one forward by one sample. The two copies are then multiplied together, and all the values added up. This is repeated for two samples difference and so on, up to the number of samples in the size option. This gives a small result if the waveform is random (for example, noise) and a large result if it is repetitive (like a musical note). By looking at the peaks in the plot, the key frequencies present can be determined even if there is a lot of noise.
  • Cepstrum:

Size

This controls how many frequency divisions are used for the spectrum, or how many samples are used for the autocorrelation. In the Spectrum, a larger number gives more accurate frequencies (narrow bands) but averages the result over a longer period of time (because more samples are needed for the calculation). In the Autocorrelation, a large size looks for repeating patterns over a larger range of time offsets, and so will detect lower frequency patterns.

Function

Selects which mathematical function is used to pre-process the data. The basic forms of the FFT and autocorrelation algorithm require infinitely long sections of audio to work on, and so take infinitely long to complete. Hence the available audio must be pre-processed in such a way that the finite length of the audio has minimum effect. The available functions are known as "windows":

  • Rectangular: is the simplest - it just cuts off the given block of samples with a sharp cut, and so leaves a sharp click at each end of the data. This means the results are often poor with a lot of random frequencies in them.
  • Hamming, Hanning, Bartlett: do a smooth fade in and out of the audio data, and so give cleaner, more accurate results. Can someone explain what the differences are? I've only done Hamming!
  • Blackman, Blackman-Harris:
  • Welch:
  • Gaussian:

Axis

When using the Spectrum, the frequencies can be displayed on a linear scale (default, which gives equal width to each increment on the scale) or on a logarithmic scale.


Find Clipping...

Looks for and displays runs of "clipped" samples in a Label Track, as a screen-reader accessible alternative to View > Show Clipping. The number of samples for the "Start threshold" and "Stop threshold" is configurable, and set to three by default. The "threshold" is the number of contiguous clipped samples that must be present before they will be labeled as a run. The Start threshold therefore means the number of clipped samples that must be present to left of a non-clipped sample before it will be labeled as an individual run, and Stop threshold the number that must be present to right of a non-clipped sample before it will be labeled as an individual run. Note that unlike "Show Clipping", "Find Clipping" is not automatic, so after any editing that affects volume, or after import or recording of a new track, Find Clipping must be run again.

Each individual run of clipped samples is marked in the format "number of number". For example, "11 of 11" means there are 11 samples in this run of clipped samples and all of them are clipped, while "9 of 11" means only nine samples in the run of 11 are clipped. Note that in the case of a stereo track, each channel will have its own label, so if at one point both channels are clipped, two labels will appear above each other.

Beat Finder...

Beat Finder attempts to place labels at beats which are much louder than the surrounding audio. It's a fairly rough and ready tool, and won't necessarily work well on a typical modern pop music track with compressed dynamic range. If you don't get enough beats detected, try reducing the "Threshold Percentage" setting.

Regular interval labels...

This places labels in a long track so as to divide it into smaller, equally sized segments. For example, you may want to break up a long file into smaller audio files for distribution on the internet. The labels can contain the name of the separate files, then File > Export Multiple... will export all the files in one process based on the label points. You can either choose the number of labels to be created, or the interval between them.

Silence Finder...

This also places labels in a long track to divide it up, but does so at areas of silence according to the level and duration of silence you specify. This is very useful to break up a long recording such as from an LP or cassette into its individual tracks. The labels can contain the name of each track, and then File > Export Multiple... will export audio files in one process corresponding to each track of the recording. If too many silences are detected, increase the silence level and duration; if too few are detected, reduce the level and duration.