Aften: A/52 audio encoder Version SVN (c) 2006-2007 Justin Ruggles, Prakash Punnoor, et al. usage: aften [options] <input.wav> <output.ac3> options: CONSOLE OUTPUT OPTIONS [-h] Print out list of commandline options. [-longhelp] Print out commandline option details. [-version] Print out the build version. [-v #] Verbosity This controls the level of console output to stderr. 0 - Quiet Mode. No output to stderr. 1 - Shows a running average of encoding statistics. This is the default setting. 2 - Shows the statistics for each frame. ENCODING OPTIONS [-threads #] Number of threads Aften can use multiple threads to speed up encoding. By default, Aften uses one thread for each logical CPU your system has, but you can override this value. A value of 0 is the default and indicates that Aften should try to detect the number of CPUs. [-nosimd X] Comma-separated list of SIMD instruction sets not to use Aften will auto-detect available SIMD instruction sets for your CPU, so you shouldn't need to disable sets explicitly - unless for speed or debugging reasons. Available sets are mmx, sse, sse2, sse3 and altivec. No spaces are allowed between the sets and the commas. Example: -nosimd sse2,sse3 [-b #] CBR bitrate in kbps CBR mode is selected by default. This option allows for setting the fixed bitrate. The default bitrate depends on the number of channels (not including LFE). mono = 96 kbps stereo = 192 kbps 3-channel = 256 kbps 4-channel = 384 kbps 5-channel = 448 kbps [-q #] VBR quality A value 0 to 1023 which corresponds to SNR offset, where q=240 equates to an SNR offset of 0. 240 is the default value. This scale will most likely be replaced in the future with a better quality measurement. [-fba #] Fast bit allocation Fast bit allocation is a less-accurate search method for CBR bit allocation. It only narrows down the SNR value to within 16 of the optimal value. The result is lower overall quality, but faster encoding. This may not give the same results each time when using parallel encoding. [-fes #] Fast exponent strategy decision By default, the exponent strategy for each channel in a frame is decided by finding the best choice out of 5 pre-defined sets of strategies. When this fast option is turned on, the same set is always used for every channel in every frame, which leads to generally lower quality but gives a significant speed increase. [-pad #] Start-of-stream padding The AC-3 format uses an overlap/add cycle for encoding each block. By default, Aften pads the delay buffer with a block of silence to avoid inaccurate encoding of the first frame of audio. If this behavior is not wanted, it can be disabled. The pad value can be a 1 (default) to use padding or 0 to not use padding. [-w #] Bandwidth The bandwidth setting corresponds to the high-frequency cutoff. Specifically, it sets the highest frequency bin which is encoded. The AC-3 format uses a 512-point MDCT which gives 256 frequency levels from 0 to 1/2 of the samplerate. The formula to give the number of coded frequency bins from bandwidth setting is: (w * 3) + 73, which gives a range of 73 to 253 I hope to replace this setting with one where the user specifies the actual cutoff frequency rather than the bandwidth code. There are 2 special values, -1 and -2. When -1 is used, Aften automatically selects what it thinks is an appropriate bandwidth. This is the default setting. When -2 is used, a bandwidth is chosen for each frame based on CBR frame size and a target quality of 240. Variable bandwidth cannot be used with VBR mode. [-wmin #] Minimum bandwidth For variable bandwidth mode (-2), this option sets the minimum value for the bandwidth code. This allows the user to avoid a harsh cutoff frequency by sacrificing general audio quality. The default value is 0. [-wmax #] Maximum bandwidth For variable bandwidth mode (-2), this option sets the maximum value for the bandwidth code. This can be used to speed up encoding by using a lower value than 60, which is the default. [-m #] Stereo rematrixing Using stereo rematrixing can increase quality by removing redundant information between the left and right channels. This technique is common in audio encoding, and is sometimes called mid/side encoding. When this setting is turned on, Aften adaptively turns rematrixing on or off for each of 4 frequency bands for each block. When this setting is turned off, rematrixing is not used for any blocks. The default value is 1. [-s #] Block switching The AC-3 format allows for 2 different types of MDCT transformations to translate from time-domain to frequency-domain. The default is a 512-point transform, which gives better frequency resolution. There is also a 256-point transform, which gives better time resolution. The specification gives a suggested method for determining when to use the 256-point transform. When block switching is turned on, Aften uses the spec method for selecting the 256-point MDCT. When it is turned off, only the 512-point MDCT is used, which is faster. Block switching is turned off by default. BITSTREAM INFO METADATA [-cmix #] Center mix level When three front channels are in use, this code indicates the nominal down mix level of the center channel with respect to the left and right channels. 0 = -3.0 dB (default) 1 = -4.5 dB 2 = -6.0 dB [-smix #] Surround mix level If surround channels are in use, this code indicates the nominal down mix level of the surround channels. 0 = -3 dB (default) 1 = -6 dB 2 = 0 [-dsur #] Dolby Surround mode When operating in the two channel mode, this code indicates whether or not the program has been encoded in Dolby Surround. This information is not used by the AC-3 decoder, but may be used by other portions of the audio reproduction equipment. 0 = not indicated (default) 1 = not Dolby surround encoded 2 = Dolby surround encoded DYNAMIC RANGE COMPRESSION AND DIALOG NORMALIZATION [-dynrng #] Dynamic Range Compression profile Dynamic Range Compression allows for the final output dynamic range to be limited without sacrificing quality. The full dynamic range audio is still encoded, but a code is given for each block which tells the decoder to adjust the output volume for that block. The encoder must analyze the input audio to determine the best way to compress the dynamic range based on the loudness and type of input (film, music, speech). 0 = Film Light 1 = Film Standard 2 = Music Light 3 = Music Standard 4 = Speech 5 = None (default) [-dnorm #] Dialog normalization [0 - 31] (default: 31) The dialog normalization value sets the average dialog level. The value is typically constant for a parti...
fullkiller