Download A Model of Partial Tracks for Tension-Modulated Steel-String Guitar Tones This paper introduces a spectral model for plucked, steel string tones, based on functional models for time-varying fundamental frequency and inharmonicity coefficient. Techniques to evaluate those analytical values at different time indexes are reviewed and commented. A method to evaluate the unknowns of the fundamental frequency and inharmonicity coefficient functions and match the data of a given tone is presented. Frequency tracks can thereafter be deployed and traced for all values of time. Their accuracy is discussed, and applications for the model are suggested.
Download Methods for Separation of Amplitude and Frequency Modulation in Fourier Transformed Signals This paper describes methods for the removal and/or separation of amplitude and frequency modulation of individual components within a Fourier spectrum. The first proposed method has a relatively low cost and works under assumptions about the behaviour of both the local and non-local magnitude and phase of sinusoidal components for these two forms of component nonstationarity. The second method is more expensive and resynthesizes components either in the Fourier or time domain following a parameter estimation stage. Typical applications are the adjustment of expressive parameters in music signals and conditioning of signals prior to cross-synthesis.
Download Fast Signal Reconstruction from Magnitude STFT Spectrogram Based on Spectrogram Consistency The modification of magnitude spectrograms is at the core of many audio signal processing methods, from source separation to sound modification or noise canceling, and reconstructing a natural sounding signal in such situations is thus a very important issue. This article presents recent theoretical and experimental developments on the application to signal reconstruction from a modified magnitude spectrogram of the constraints that an array of complex numbers must verify to be a consistent short-time Fourier transform (STFT) spectrogram, i.e., to be the STFT spectrogram of an actual real-valued signal. We give here further theoretical insights, present several potential variations on our previously introduced algorithm, investigate various techniques to speed up the signal reconstruction process, and present a thorough experimental comparison of the performance of all the considered algorithms.
Download High-Performance Real-Time FIR-Filtering Using Fast Convolution on Graphics Hardware In this paper we examine how graphic hardware can be used for real-time FIR filtering. We implement uniformly-partitioned fast convolution in the frequency-domain and evaluate its performance on a NVIDIA GTX 285 graphics card. Motivated by audio rendering for virtual reality, our focus lies on large-scale realtime filtering with a multitude of channels, long impulse responses and low latencies. Graphics hardware has already been used for audio signal processing — including FIR and IIR filtering with respect to offline and real-time processing. However, the combination of GPU computing and real-time conditions leads to a number of challenges that have not been reviewed in detail. The new contribution of this paper is an implementation and detailled analysis of a frequency-domain fast convolution method on GPUs. We discuss specific problems that emerge under real-time conditions. Our method allows to achieve an outstanding real-time filtering performance. In this work, we do not only regard a timeinvariant filtering, but also time-varying filtering, where filters are exchanged during runtime. Furthermore, we examine the opportunities of distributed computation — using CPU and GPU — in order to maximize the performance. Finally, we identify bottlenecks and explain their impact on filter exchange latencies and update rates.
Download Self-Organised Sounds with a Tremolo Oscillator Tremolo is usually regarded as belonging to the domain of note embellishments. Rapid tremolo, taken into the audio range, is an interesting synthesis technique which is related to FM and granular synthesis. We present a tremolo oscillator, capable of a wide range of sonorities, and illustrate some of its capabilities in applications such as feature-based synthesis and sonification. A reference implementation in Csound is given. The tremolo oscillator is then put into a feedback system, where its output is subject to feature extraction, and the extracted features in turn are mapped to its control parameters. Chaotic orbits in this feedback system guarantee continuous variation, in contrast to the trivial periodic patterns that are easily obtained.
Download Digital Emulation of Distortion Effects by Wave and Phase Shaping Methods This paper will consider wave (amplitude) and phase signal shaping techniques for the digital emulation of distortion effect processing. We examine how to determine the Wave- and Phaseshaping functions with harmonic amplitude and phase data. Three distortion effects units are used to provide test data. The action of the Wave- and Phase- shaping functions derived for these effects is demonstrated with the assistance of a superresolution frequency-domain analysis technique.
Download Chebyshev Model and Synchronized Swept Sine Method in Nonlinear Audio Effect Modeling A new method for the identification of nonlinear systems, based on an input exponential swept sine signal has been proposed by Farina ten years ago. This method has been recently modified in purpose of nonlinear model estimation using a synchronized swept sine signal. It allows a robust and fast one-path analysis and identification of the unknown nonlinear system under test. In this paper this modified method is applied with Chebyshev polynomial decomposition. The combination of the Synchronized Swept Sine Method and Chebyshev polynomials leads to a nonlinear model consisting of several parallel branches, each branch containing a nonlinear Chebyshev polynomial following by a linear filter. The method is tested on an overdrive effect pedal to simulate an analog nonlinear effect in digital domain.
Download Bibliometric Study of the DAFx Proceedings 1998 - 2009 In this paper we present a bibliometric study of the Digital Audio Effects (DAFx) conference proceedings from 1998 to 2009. Using the online DAFx proceedings, we constructed a DAFx database (LaTeX) to study its bibliometric statistics in terms of research topics, growth of literature, authorship distribution, citation patterns, and frequency distribution of scientific productivity. Results showed that the DAFx literature (with quasi-linear accumulative growth) now consists of 722 contributions (including key notes, papers and posters) from 767 unique authors, from which we identified the 20 top DAFx contributors. Using Google Scholar, we identified that the top 10 most cited DAFx papers (between 43 to 65 times) are in majority (8/10) dealing with sound and music analysis (e.g. extraction of sinusoids, musical genre classification, perceived intensity of music, and musical note onset detection). This study also confirmed that the DAFx literature conforms to the Lokta’s law (n=2.0771 and C=0.6336) at 0.01 level of significance using the Kolmogorov-Smirnov test (KS-test) of goodnessof-fit. The DAFx database will serve as the basis for an Author Cocitation Analysis (ACA) and to create a DAFx conferences archive DVD.
Download Estimation and Modeling of Pinna-Related Transfer Functions This paper considers the problem of modeling pinna-related transfer functions (PRTFs) for 3-D sound rendering. Following a structural modus operandi, we present an algorithm for the decomposition of PRTFs into ear resonances and frequency notches due to reflections over pinna cavities. Such an approach allows to control the evolution of each physical phenomenon separately through the design of two distinct filter blocks during PRTF synthesis. The resulting model is suitable for future integration into a structural head-related transfer function model, and for parametrization over anthropometrical measurements of a wide range of subjects.
Download Spatial Audio Object Coding with Enhanced Audio Object Separation Spatial sound reproduction on a multi-channel loudspeaker setup indicate a consistent trend in today’s audio playback systems. Digital surround sound significantly improves the realism of the spatial sound experience, but also results in a drastic increase in required audio data rate. Spatial Audio Coding (SAC) technology provides means for efficient storage and transmission of multi-channel signals by a downmix signal and associated parametric side information describing the spatial sound image. More recently, SAC has been extended with an object-based concept termed Spatial Audio Object Coding (SAOC) enabling efficient coding and interactive spatial rendering of multiple individual audio objects at the playback side. Due to the underlying parametric coding approach, object level manipulations may affect the produced perceptual sound scene quality, and using extreme object attenuation or boosting may result in unacceptably degraded audio quality. The paper describes how regular SAOC processing is advanced to ensure high quality sound reproduction even in demanding remix applications.