Download Leveraging Electric Guitar Tones and Effects to Improve Robustness in Guitar Tablature Transcription Modeling Guitar tablature transcription (GTT) aims at automatically generating symbolic representations from real solo guitar performances. Due to its applications in education and musicology, GTT has gained traction in recent years. However, GTT robustness has been limited due to the small size of available datasets. Researchers have recently used synthetic data that simulates guitar performances using pre-recorded or computer-generated tones, allowing for scalable and automatic data generation. The present study complements these efforts by demonstrating that GTT robustness can be improved by including synthetic training data created using recordings of real guitar tones played with different audio effects. We evaluate our approach on a new evaluation dataset with professional solo guitar performances that we composed and collected, featuring a wide array of tones, chords, and scales.
Download The Influence of Small Variations in a Simplified Guitar Amplifier Model A strongly simplified guitar amplifier model, consisting of four stages, is presented. The exponential sweep technique is used to measure the frequency dependent harmonic spectra. The influence of small variations of the system parameters on the harmonic components is analyzed. The differences of the spectra are explained and visualized.
Download Morphing techniques for enhanced scat singing In jazz, scat singing is a phonetic improvisation that imitates instrumental sounds. In this paper, we propose a system that aims to transform singing voice into real instrument sounds, extending the possibilities for scat singers. Analysis algorithms in the spectral domain extract voice parameters, which drive the resulting instrument sound. A small database contains real instrument samples that have been spectrally analyzed offline. Two different prototypes are introduced, producing sounds of a trumpet and a bass guitar respectively.
Download Visualization of Signals and Algorithms in Kronos Kronos is a visual-oriented programming language and a compiler aimed at musical signal processing tasks. Its distinctive feature is the support for functional programming idioms like closures and higher order functions in the context of high performance real time DSP. This paper examines the visual aspect of the system. The programming user interface is discussed, along with a scheme for building custom data visualization algorithms inside the system.
Download Compositional Use Of Digital Audio Effects This article tracks the composers’ desire for increased control over continuous musical variables through examples for music from the last few centuries. Examples are given on the development of notation and orchestration. A special focus is provided on electroacoustic music as a natural continuation of this development, and typical types of timbral development and structural discourse are brought forward in an attempt to explain which parameters composers of electroacoustic music are considering in their work.
Download What you hear is what you see: Audio quality from Image Quality Metrics In this study, we investigate the feasibility of utilizing stateof-the-art perceptual image metrics for evaluating audio signals by representing them as spectrograms. The encouraging outcome of the proposed approach is based on the similarity between the neural mechanisms in the auditory and visual pathways. Furthermore, we customise one of the metrics which has a psychoacoustically plausible architecture to account for the peculiarities of sound signals. We evaluate the effectiveness of our proposed metric and several baseline metrics using a music dataset, with promising results in terms of the correlation between the metrics and the perceived quality of audio as rated by human evaluators.
Download High-Performance Real-Time FIR-Filtering Using Fast Convolution on Graphics Hardware In this paper we examine how graphic hardware can be used for real-time FIR filtering. We implement uniformly-partitioned fast convolution in the frequency-domain and evaluate its performance on a NVIDIA GTX 285 graphics card. Motivated by audio rendering for virtual reality, our focus lies on large-scale realtime filtering with a multitude of channels, long impulse responses and low latencies. Graphics hardware has already been used for audio signal processing — including FIR and IIR filtering with respect to offline and real-time processing. However, the combination of GPU computing and real-time conditions leads to a number of challenges that have not been reviewed in detail. The new contribution of this paper is an implementation and detailled analysis of a frequency-domain fast convolution method on GPUs. We discuss specific problems that emerge under real-time conditions. Our method allows to achieve an outstanding real-time filtering performance. In this work, we do not only regard a timeinvariant filtering, but also time-varying filtering, where filters are exchanged during runtime. Furthermore, we examine the opportunities of distributed computation — using CPU and GPU — in order to maximize the performance. Finally, we identify bottlenecks and explain their impact on filter exchange latencies and update rates.
Download Trans-synthesis System for Polyphonic Musical Recordings of Bowed-String Instruments A system that tries to analyze polyphonic musical recordings of bowed-string instruments, extract synthesis parameters of individual instrument and then re-synthesize is proposed. In the analysis part, multiple F0s estimation and partials tracking are performed based on modified WGCDV (weighted greatest common divisor and vote) method and high-order HMM. Then, dynamic time warping algorithm is employed to align the above results with the score to improve the accuracy of the extracted parameters. In the re-synthesis part, simple additive synthesis is employed. Here, one can experiment on changing timbres, pitches and so on or adding vibrato or other effects on the same piece of music.
Download Separation Of Speech Signal From Complex Auditory Scenes The hearing system, even in front of complex auditory scenes and in unfavourable conditions, is able to separate and recognize auditory events accurately. A great deal of effort has gone into the understanding of how, after having captured the acoustical data, the human auditory system processes them. The aim of this work is the digital implementation of the decomposition of a complex sound in separate parts as it would appear to a listener. This operation is called signal separation. In this work, the separation of speech signal from complex auditory scenes has been studied and an experimentation of the techniques that address this problem has been done.
Download Audio Signal Processing and Object-oriented Systems Object-oriented programming (OOP) has been for many years now one of the most important programming paradigms used in a variety of applications. Digital audio signal processing can benefit largely from this approach for systems development. In this paper a number of approaches to using object-orientation in audio processing systems are reviewed. Existing systems of audio processing are introduced and discussed in detail. The paper also draws attention to the different OOP techniques enabled and supported by these systems. Comparative code and tutorial examples are included, providing an insight into the development of signal processing applications using objects.