Download Continuous and discrete Fourier spectra of aperiodic sequences for sound modeling
The Fourier analysis of aperiodic ordered time structures related with number eight is considered. Recursion relations for the Fourier amplitudes are obtained for a sequence with discrete spectrum. The continuous spectrum of a different type of sequence is also studied . By increasing the number of points in the time axis dynamic spectra can be obtained and used for sound synthesis.
Download Improving intelligibility prediction under informational masking using an auditory saliency model
The reduction of speech intelligibility in noise is usually dominated by energetic masking (EM) and informational masking (IM). Most state-of-the-art objective intelligibility measures (OIM) estimate intelligibility by quantifying EM. Few measures model the effect of IM in detail. In this study, an auditory saliency model, which intends to measure the probability of the sources obtaining auditory attention in a bottom-up process, was integrated into an OIM for improving the performance of intelligibility prediction under IM. While EM is accounted for by the original OIM, IM is assumed to arise from the listener’s attention switching between the target and competing sounds existing in the auditory scene. The performance of the proposed method was evaluated along with three reference OIMs by comparing the model predictions to the listener word recognition rates, for different noise maskers, some of which introduce IM. The results shows that the predictive accuracy of the proposed method is as good as the best reported in the literature. The proposed method, however, provides a physiologically-plausible possibility for both IM and EM modelling.
Download Constrained Pole Optimization for Modal Reverberation
The problem of designing a modal reverberator to match a measured room impulse response is considered. The modal reverberator architecture expresses a room impulse response as a parallel combination of resonant filters, with the pole locations determined by the room resonances and decay rates, and the zeros by the source and listener positions. Our method first estimates the pole positions in a frequency-domain process involving a series of constrained pole position optimizations in overlapping frequency bands. With the pole locations in hand, the zeros are fit to the measured impulse response using least squares. Example optimizations for a mediumsized room show a good match between the measured and modeled room responses.
Download AudioBIFS: The MPEG-4 Standard for Effects Processing
We present a tutorial overview of the AudioBIFS system, part of the Binary Format for Scene Description in the MPEG-4 International Standard. AudioBIFS allows the flexible construction of sound scenes using streaming audio, interactive presentation, 3-D spatialization and environmental auralization, and dynamic download of custom signal-processing routines. MPEG-4 sound scenes are based on a model that is a superset of the model in VRML 2.0, and a comparison between the two models is presented. We discuss the use of SAOL, the MPEG-4 Structured Audio Orchestra Language, for writing downloadable effects. The current status of the standard is described.
Download A Sound Localization based Interface for Real-Time Control of Audio Processing
This paper describes the implementation of an innovative musical interface based on the sound localization capability of a microphone array. Our proposal is to allow a musician to plan and conduct the expressivity of a performance, by controlling in realtime an audio processing module through the spatial movement of a sound source, i.e. voice, traditional musical instruments, sounding mobile devices. The proposed interface is able to locate and track the sound in a two-dimensional space with accuracy, so that the x-y coordinates of the sound source can be used to control the processing parameters. In particular, the paper is focused on the localization and tracking of harmonic sound sources in real moderate reverberant and noisy environment. To this purpose, we designed a system based on adaptive parameterized Generalized Cross-Correlation (GCC) and Phase Transform (PHAT) weighting with Zero-Crossing Rate (ZCR) threshold, a Wiener filter to improve the Signal to Noise Ratio (SNR) and a Kalman filter to make the position estimation more robust and accurate. We developed a Max/MSP external objects to test the system in a real scenario and to validate its usability.
Download Automatic Decomposition of Non-linear Equation Systems in Audio Effect Circuit Simulation
In the digital simulation of non-linear audio effect circuits, the arising non-linear equation system generally poses the main challenge for a computationally cheap implementation. As the computational complexity grows super-linearly with the number of equations, it is beneficial to decompose the equation system into several smaller systems, if possible. In this paper we therefore develop an approach to determine such a decomposition automatically. We limit ourselves to cases where an exact decomposition is possible, however, and do not consider approximate decompositions.
Download Lyapunov Stability Analysis of the Moog Ladder Filter and Dissipativity Aspects in Numerical Solutions
This paper investigates the passivity of the Moog Ladder Filter and its simulation. First, the linearized system is analyzed. Results based on the energy stored in the capacitors lead to a stability domain which is available for time-varying control parameters meanwhile it is sub-optimal for time-invariant ones. A second storage function is proposed, from which the largest stability domain is recovered for a time-invariant Q-parameter. Sufficient conditions for stability are given. Second, the study is adapted to the nonlinear case by introducing a third storage function. Then, a simulation based on the standard bilinear transform is derived and the dissipativity of this numerical version is examined. Simulations show that passivity is not unconditionally guaranteed, but mostly fulfilled, and that typical behaviours of the Moog filter, including self-oscillations, are properly reproduced.
Download Sound synthesis using an allpass filter chain with audio‐rate coefficient modulation
This paper describes a sound synthesis technique that modulates the coefficients of allpass filter chains using audio-rate frequencies. It was found that modulating a single allpass filter section produces a feedback AM–like spectrum, and that its bandwidth is extended and further processed by non-sinusoidal FM when the sections are cascaded. The cascade length parameter provides dynamic bandwidth control to prevent upper range aliasing artifacts, and the amount of spectral content within that band can be controlled using a modulation index parameter. The technique is capable of synthesizing rich and evolving timbres, including those resembling classic virtual analog waveforms. It can also be used as an audio effect with pitch-tracked input sources. Software and sound examples are available at http://www.acoustics.hut.fi/publications/papers/dafx09-cm/
Download A toolkit for experimentation with signal interaction
This paper will describe a toolkit for experimentation with signal interaction techniques, also commonly referred to as cross adaptive processing. The technique allows analyzed features of one audio signal to inform the processing of another. Earlier used mainly for mixing and post production purposes, we now want to use it creatively as an intervention in the musical communication between two performers. The idea stems from Stockhausen’s use of intermodulation in the 1960’s, and as such we might also call the updated technique interprocessing. Our interest in the technique comes as a natural extension to previous research on live processing as an instrumental and performative activity. The automatic control of signal processing routines is related to previous work on adaptive audio effects and automatic mixing. The focus for our investigation and experimentation with the current toolkit will be how this affects the musical communication between performers, and how it changes what they can and will play. The program code for the toolkit is available as a github repository1 under an open source license.
Download On studying auditory distance perception in concert halls with multichannel auralizations
Virtual acoustics and auralizations have been previously used to study the perceptual properties of concert hall acoustics in a descriptive profiling framework. The results have indicated that the apparent auditory distance to the orchestra might play a crucial role in enhancing the listening experience and the appraisal of hall acoustics. However, it is unknown how the acoustics of the hall influence auditory distance perception in such large spaces. Here, we present one step towards studying auditory distance perception in concert halls with virtual acoustics. The aims of this investigation were to evaluate the feasibility of the auralizations and the system to study perceived distances as well as to obtain first evidence on the effects of hall acoustics and the source materials to distance perception. Auralizations were made from measured spatial impulse responses in two concert halls at 14 and 22 meter distances from the center of a calibrated loudspeaker orchestra on stage. Anechoic source materials included symphonic music and pink noise as well as signals produced by concatenating random segments of anechoic instrument recordings. Forty naive test subjects were blindfolded before entering the listening room, where they verbally reported distances to sound sources in the auralizations. Despite the large variance in distance judgments between the individuals, the reported distances were on average in the same range as the actual distances. The results show significant main effects of halls, distances and signals, but also some unexpected effects associated with the presentation order of the stimuli.