Download Real-Time Black-Box Modelling With Recurrent Neural Networks
This paper proposes to use a recurrent neural network for black-box modelling of nonlinear audio systems, such as tube amplifiers and distortion pedals. As a recurrent unit structure, we test both Long Short-Term Memory and a Gated Recurrent Unit. We compare the proposed neural network with a WaveNet-style deep neural network, which has been suggested previously for tube amplifier modelling. The neural networks are trained with several minutes of guitar and bass recordings, which have been passed through the devices to be modelled. A real-time audio plugin implementing the proposed networks has been developed in the JUCE framework. It is shown that the recurrent neural networks achieve similar accuracy to the WaveNet model, while requiring significantly less processing power to run. The Long Short-Term Memory recurrent unit is also found to outperform the Gated Recurrent Unit overall. The proposed neural network is an important step forward in computationally efficient yet accurate emulation of tube amplifiers and distortion pedals.
Download A Minimal Passive Model of the Operational Amplifier: Application to Sallen-Key Analog Filters
This papers stems from the fact that, whereas there are passive models of transistors and tubes, a minimal passive model of the operational amplifier does not seem to exist. A new behavioural model is presented that is memoryless, fully described by its interaction ports, with a minimal number of equations, for which a passive power balance can be defined. The proposed model handles saturation, asymmetric power supply, and can be used with nonideal voltage references. To illustrate the model in audio applications, the non-inverting voltage amplifier and a saturating Sallen-Key lowpass filter are considered.
Download Towards Inverse Virtual Analog Modeling
Several digital signal processing approaches, generally referred to as Virtual Analog (VA) modeling, are currently under development for the software emulation of analog audio circuitry. The main purpose of VA modeling is to faithfully reproduce the behavior of real-world audio gear, e.g., distortion effects, synthesizers or amplifiers, using efficient algorithms. In this paper, however, we provide a preliminary discussion about how VA modeling can be exploited to infer the input signal of an analog audio system, given the output signal and the parameters of the circuit. In particular, we show how an inversion theorem known in circuit theory, and based on nullors, can be used for this purpose. As recent advances in Wave Digital Filter (WDF) theory allow us to implement circuits with nullors in a systematic fashion, WDFs prove to be useful tools for inverse VA modeling. WDF realizations of a nonlinear audio system and its inverse are presented as an example of application.
Download Data Augmentation for Instrument Classification Robust to Audio Effects
Reusing recorded sounds (sampling) is a key component in Electronic Music Production (EMP), which has been present since its early days and is at the core of genres like hip-hop or jungle. Commercial and non-commercial services allow users to obtain collections of sounds (sample packs) to reuse in their compositions. Automatic classification of one-shot instrumental sounds allows automatically categorising the sounds contained in these collections, allowing easier navigation and better characterisation. Automatic instrument classification has mostly targeted the classification of unprocessed isolated instrumental sounds or detecting predominant instruments in mixed music tracks. For this classification to be useful in audio databases for EMP, it has to be robust to the audio effects applied to unprocessed sounds. In this paper we evaluate how a state of the art model trained with a large dataset of one-shot instrumental sounds performs when classifying instruments processed with audio effects. In order to evaluate the robustness of the model, we use data augmentation with audio effects and evaluate how each effect influences the classification accuracy.
Download A general-purpose deep learning approach to model time-varying audio effects
Audio processors whose parameters are modified periodically over time are often referred as time-varying or modulation based audio effects. Most existing methods for modeling these type of effect units are often optimized to a very specific circuit and cannot be efficiently generalized to other time-varying effects. Based on convolutional and recurrent neural networks, we propose a deep learning architecture for generic black-box modeling of audio processors with long-term memory. We explore the capabilities of deep neural networks to learn such long temporal dependencies and we show the network modeling various linear and nonlinear, time-varying and time-invariant audio effects. In order to measure the performance of the model, we propose an objective metric based on the psychoacoustics of modulation frequency perception. We also analyze what the model is actually learning and how the given task is accomplished.
Download Modelling of nonlinear state-space systems using a deep neural network
In this paper we present a new method for the pseudo black-box modelling of general continuous-time state-space systems using a discrete-time state-space system with an embedded deep neural network. Examples are given of how this method can be applied to a number of common nonlinear electronic circuits used in music technology, namely two kinds of diode-based guitar distortion circuits and the lowpass filter of the Korg MS-20 synthesizer.
Download Digital Grey Box Model of the Uni-Vibe Effects Pedal
This paper presents a digital grey box model of a late 1960s era Shin-ei Uni-Vibe(r) 1 analog effects foot pedal. As an early phase shifter, it achieved wide success in popular music as a unique musical effect, noteworthy for its pulsating and throbbing modulation sounds. The Uni-Vibe is an early series all-pass phaser effect, where each first-order section is a discrete component phase splitter (no operational amplifiers). The dynamic sweeping movement of the effect arises from a single LFO-driven incandescent lamp opto-coupled to the light dependent resistors (LDRs) of each stage. The proposed method combines digital circuit models with measured LDR characteristics for the four phase shift stages of an original Uni-Vibe unit, resulting in an efficient emulation that preserves the character of the Uni-Vibe. In modeling this iconic effect, we also aim to offer some historical and technical insight into the exact nature of its unique sound.
Download Analysis and Emulation of Early Digitally-Controlled Oscillators Based on the Walsh-Hadamard Transform
Early analog synthesizer designs are very popular nowadays, and the discrete-time emulation of voltage-controlled oscillator (VCO) circuits is covered by a large number of virtual analog (VA) textbooks, papers and tutorials. One of the issues of well-known VCOs is their tuning instability and sensitivity to environmental conditions. For this reason, digitally-controlled oscillators were later introduced to provide stable tuning. Up to now, such designs have gained much less attention in the music processing literature. In this paper, we examine one of such designs, which is based on the Walsh-Hadamard transform. The concept was employed in the ARP Pro Soloist and in the Welson Syntex, among others. Some historical background is provided, along with a discussion on the principle, the actual implementation and a band-limited virtual analog derivation.