All Stories

  1. Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement
  2. Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking
  3. Separation of the aortic and pulmonary components of the second heart sound via alternating optimization
  4. Sound Event Detection: A tutorial
  5. Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation
  6. CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification
  7. Sparse Analysis Model Based Dictionary Learning for Signal Declipping
  8. Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification
  9. Learning With Out-of-Distribution Data for Audio Classification
  10. Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis
  11. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
  12. Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization
  13. Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data
  14. Sparse Recovery and Dictionary Learning From Nonlinear Compressive Measurements
  15. Weakly Labelled AudioSet Tagging With Attention Neural Networks
  16. Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
  17. Information-Theoretic Approaches to Neural Network Learning
  18. Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
  19. Acoustic Event Detection from Weakly Labeled Data Using Auditory Salience
  20. Acoustic Scene Generation with Conditional Samplernn
  21. Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes
  22. Generalisation in Environmental Sound Classification: The ‘Making Sense of Sounds’ Data Set and Challenge
  23. Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering
  24. Sound Event Detection and Time–Frequency Segmentation from Weakly Labelled Data
  25. Musical Source Separation: An Introduction
  26. Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy
  27. Sound Event Localization and Detection Using CRNN on Pairs of Microphones
  28. Predicting the perceived level of reverberation using machine learning
  29. A Hierarchical Latent Mixture Model for Polyphonic Music Analysis
  30. Raw Multi-Channel Audio Source Separation using Multi- Resolution Convolutional Auto-Encoders
  31. A Contextual Study of Semantic Speech Editing in Radio Production
  32. A Demonstration of Hierarchical Structure Usage in Expressive Timing Analysis by Model Selection Tests
  33. PaperClip: A Digital Pen Interface for Semantic Speech Editing in Radio Production
  34. Inexact Proximal Operators for <tex>$\ell_{p}$</tex>-Quasinorm Minimization
  35. Orthogonality-Regularized Masked NMF for Learning on Weakly Labeled Audio Data
  36. A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data
  37. Audio Set Classification with Attention Model: A Probabilistic Perspective
  38. BSS Eval or Peass? Predicting the Perception of Singing-Voice Separation
  39. Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network
  40. Synthesis of Images by Two-Stage Generative Adversarial Networks
  41. Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
  42. Malicious User Detection Based on Low-Rank Matrix Completion in Wideband Spectrum Sensing
  43. Computational Analysis of Sound Scenes and Events
  44. Consistent Dictionary Learning for Signal Declipping
  45. Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks
  46. Latent Variable Analysis and Signal Separation
  47. Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation
  48. Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback
  49. Single channel audio source separation using convolutional denoising autoencoders
  50. Graph-based clustering for identifying region of interest in eye tracker data analysis
  51. Binaural and log-power spectra features with deep neural networks for speech-noise separation
  52. Approaches to Complex Sound Scene Analysis
  53. Future Perspective
  54. Introduction to Sound Scene and Event Analysis
  55. Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks
  56. Using deep neural networks to estimate tongue movements from speech face motion
  57. Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging
  58. Learning the Mapping Function from Voltage Amplitudes to Sensor Positions in 3D-EMA Using Deep Neural Networks
  59. Automatic music transcription using low rank non-negative matrix decomposition
  60. Joint detection and classification convolutional neural network on weakly labelled bird audio detection
  61. Masked non-negative matrix factorization for eire detection using weakly labeled data
  62. Multivariate iterative hard thresholding for sparse decomposition with flexible sparsity patterns
  63. Polyphonic Sound Event Tracking Using Linear Dynamical Systems
  64. Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
  65. Convolutional gated recurrent neural network incorporating spatial features for audio tagging
  66. A greedy algorithm with learned statistics for sparse signal reconstruction
  67. A joint detection-classification model for audio tagging of weakly labelled data
  68. Assessment of musical noise using localization of isolated peaks in time-frequency domain
  69. Fast tagging of natural sounds using marginal co-regularization
  70. Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
  71. Psychophysical Evaluation of Audio Source Separation Methods
  72. Automatic Environmental Sound Recognition: Performance Versus Computational Cost
  73. Combining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks
  74. Evaluation of audio source separation models using hypothesis-driven non-parametric statistical methods
  75. Wideband Spectrum Sensing on Real-Time Signals at Sub-Nyquist Sampling Rates in Single and Cooperative Multiple Nodes
  76. Detection of overlapping acoustic events using a temporally-constrained probabilistic model
  77. Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription
  78. Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning
  79. The Clustering of Expressive Timing Within a Phrase in Classical Piano Performances by Gaussian Mixture Models
  80. Chime-home: A dataset for sound source recognition in a domestic environment
  81. Detection and Classification of Acoustic Scenes and Events
  82. Acoustic Scene Classification: Classifying environments from the sounds they produce
  83. Event-based Multitrack Alignment using a Probabilistic Framework
  84. A dynamic programming variant of non-negative matrix deconvolution for the transcription of struck string instruments
  85. Non-negative matrix factorisation incorporating greedy hellinger sparse coding applied to polyphonic music transcription
  86. Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network
  87. Efficient compressive spectrum sensing algorithm for M2M devices
  88. Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain
  89. Learning Incoherent Subspaces: Classification via Incoherent Dictionary Learning
  90. Large‐scale analysis of frequency modulation in birdsong data bases
  91. Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning
  92. Accounting for phase cancellations in non-negative matrix factorization using weighted distances
  93. Improving instrument recognition in polyphonic music through system integration
  94. Polyphonic piano transcription using non-negative Matrix Factorisation with group sparsity
  95. Score-Informed Source Separation for Musical Audio Recordings: An overview
  96. Best Practices for Scientific Computing
  97. Big Data for Musicology
  98. Dictionary learning via projected maximal exploration
  99. Learning overcomplete dictionaries with ℓ0-sparse Non-negative Matrix Factorisation
  100. Low-rank matrix completion based malicious user detection in cooperative spectrum sensing
  101. Detection and classification of acoustic scenes and events: An IEEE AASP challenge
  102. Multichannel HR-NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
  103. Learning incoherent subspaces for classification via supervised iterative projections and rotations
  104. Structured sparsity using backwards elimination for Automatic Music Transcription
  105. On Theorem 10 in “On Polar Polytopes and the Recovery of Sparse Representations” [Sep 07 3188-3195]
  106. Hearing the shape of a room
  107. Synchronizing Sequencing Software to a Live Drummer
  108. Automatic Music Transcription using row weighted decompositions
  109. Behavior of greedy sparse representation algorithms on nested supports
  110. Improved multiple birdsong tracking with distribution derivative method and Markov renewal process clustering
  111. Recognition of harmonic sounds in polyphonic audio using a missing feature approach
  112. Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis
  113. Learning Incoherent Dictionaries for Sparse Approximation Using Iterative Projections and Rotations
  114. The Serendiptichord: Reflections on the Collaborative Design Process between Artist and Researcher
  115. Predictive Information in Gaussian Processes with Application to Music Analysis
  116. Using Oracle Analysis for Decomposition-Based Automatic Music Transcription
  117. A robust method for S1/S2 heart sounds detection without ecg reference based on music beat tracking
  118. Denoising and segmentation of the second heart sound using matching pursuit
  119. Cognitive music modelling: An information dynamics approach
  120. Analysis-based sparse reconstruction with synthesis-based solvers
  121. Audio Inpainting
  122. INK-SVD: Learning incoherent dictionaries for sparse representations
  123. Instrumentation-based music similarity using sparse representations
  124. Sound Software: Towards software reuse in audio and music research
  125. Structured sparsity for automatic music transcription
  126. A measure of statistical complexity based on predictive information with application to finite spin systems
  127. Dictionary Learning with Large Step Gradient Descent for Sparse Representations
  128. Group Polytope Faces Pursuit for Recovery of Block-Sparse Signals
  129. Performance Following: Real-Time Prediction of Musical Sequences Without a Score
  130. Reliability-Informed Beat Tracking of Musical Signals
  131. Learning Timbre Analogies from Unlabelled Data by Multivariate Tree Regression
  132. On the disjointess of sources in music using different time-frequency representations
  133. Onset Event Decoding Exploiting the Rhythmic Structure of Polyphonic Music
  134. Fast Dictionary Learning for Sparse Representations of Speech Signals
  135. Separating sources from sequentially acquired mixtures of heart signals
  136. A constrained matching pursuit approach to audio declipping
  137. Dictionary learning of convolved signals
  138. Sound Source Separation
  139. Measuring the Performance of Beat Tracking Algorithms Using a Beat Error Histogram
  140. Delayed Decision-making in Real-time Beatbox Percussion Classification
  141. Sparse Representations in Audio and Music: From Coding to Source Separation
  142. An L1 criterion for dictionary learning by subspace identification
  143. A Multichannel Spatial Compressed Sensing Approach for Direction of Arrival Estimation
  144. Gradient Polytope Faces Pursuit for large scale sparse recovery problems
  145. Non-negative mixtures
  146. Note onset detection using rhythmic structure
  147. Performance following: Tracking a performance without a score
  148. SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
  149. Evaluation of live human–computer music-making: Quantitative and qualitative approaches
  150. Towards a musical beat emphasis function
  151. Information dynamics: patterns of expectation and surprise in the perception of music
  152. Sparse reconstruction for compressed sensing using Stagewise Polytope Faces Pursuit
  153. Fast Multidimensional Entropy Estimation by $k$-d Partitioning
  154. Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation
  155. INFORMATION DYNAMICS AND THE PERCEPTION OF TEMPORAL STRUCTURE
  156. Using phase linearity in frequency-domain ICA to tackle the permutation problem
  157. Estimating Phase Linearity in the Frequency-Domain ICA Demixing Matrix
  158. Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation
  159. Efficient Bayesian inference for harmonic models via adaptive posterior factorization
  160. Audio analysis using sparse representations.
  161. An adaptive stereo basis method for convolutive blind audio source separation
  162. Speech Separation Using an Adaptive Sparse Dictionary Algorithm
  163. An adaptive orthogonal sparsifying transform for speech signals
  164. Oracle estimation of adaptive cosine packet transforms for underdetermined audio source separation
  165. Theorems on Positive Data: On the Uniqueness of NMF
  166. On Polar Polytopes and the Recovery of Sparse Representations
  167. Audio source separation with a signal-adaptive local cosine transform
  168. Oracle estimators for the benchmarking of source separation algorithms
  169. Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models
  170. Context-Dependent Beat Tracking of Musical Audio
  171. B-Keeper
  172. Blind Source Separation using Space–Time Independent Component Analysis
  173. Flag Manifolds for Subspace ICA Problems
  174. Geometry and Manifolds for Independent Component Analysis
  175. Independent Component Analysis and Signal Separation
  176. On the Use of Entropy for Beat Tracking Evaluation
  177. Real-time beat-synchronous audio effects
  178. Information theory and sensory perception
  179. Fast Factorization-Based Inference for Bayesian Harmonic Models
  180. Sparse representations of polyphonic music
  181. Recovery of Sparse Representations by Polytope Faces Pursuit
  182. Riemannian Optimization Method on the Flag Manifold for Independent Subspace Analysis
  183. Riemannian Optimization Method on Generalized Flag Manifolds for Complex and Subspace ICA
  184. Single-Channel Mixture Decomposition Using Bayesian Harmonic Models
  185. Sparse Coding for Convolutive Blind Audio Source Separation
  186. Unsupervised Analysis of Polyphonic Music by Sparse Coding
  187. Geometrical methods for non-negative ICA: Manifolds, Lie groups and toral subalgebras
  188. Beat tracking with a two state model [music applications]
  189. Blind Separation of Positive Sources by Globally Convergent Gradient Search
  190. A "nonnegative PCA" algorithm for independent component analysis
  191. Application of Geometric Dependency Analysis to the Separation of Convolved Mixtures
  192. Lie Group Methods for Optimization with Orthogonality Constraints
  193. Optimization Using Fourier Expansion over a Geodesic for Non-negative ICA
  194. Algorithms for nonnegative independent component analysis
  195. Automatic Music Transcription and Audio Source Separation
  196. Conditions for nonnegative independent component analysis
  197. Do cortical maps adapt to optimize information density?
  198. Do cortical maps adapt to optimize information density?
  199. Maximizing information about a noisy signal with a single non-linear neuron
  200. Designing Neural Networks using a Genetic Rule-based System
  201. Unsupervised neural network learning procedures for feature extraction and classification
  202. Information processing in negative feedback neural networks
  203. Information processing in negative feedback neural networks
  204. Lyapunov functions for convergence of principal component algorithms
  205. Analysis of an Unsupervised Indirect Feedback Network
  206. Generation and adaptation of neural networks by evolutionary techniques (GANNET)
  207. Approximating Optimal Information Transmission using Local Hebbian Algorithms in a Double Feedback Loop
  208. Efficient information transfer and anti-Hebbian neural networks
  209. Information Theory and Neural Networks
  210. Direct Approaches to Improving the Robustness of Multilayer Neural Networks
  211. The effect of receptor signal-to-noise levels on optimal filtering in a sensory system
  212. Sensory adaptation: an information-theoretic viewpoint
  213. Musical audio analysis using sparse representations
  214. Audio Source Separation using Sparse Representations
  215. Probabilistic Modeling Paradigms for Audio Source Separation
  216. Natural Conjugate Gradient on Complex Flag Manifolds for Complex Independent Subspace Analysis
  217. Dictionary Learning for L1-Exact Sparse Coding
  218. The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
  219. A prototype system for object coding of musical audio
  220. Identification of dental bacteria using statistical and neural approaches
  221. Information and Density and Cortical Magnification Factors
  222. Communications and neural networks: theory and practice