Please note: This is a beta version of the new dblp website.
You can find the classic dblp view of this page here.
You can find the classic dblp view of this page here.
Daniel P. W. Ellis
Dan Ellis
2010 – today
- 2012
[j24]Jon Gudnason, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor: Data-driven voice source waveform analysis and synthesis. Speech Communication 54(2): 199-211 (2012)
[c68]Byung Suk Lee, Daniel P. W. Ellis: Noise Robust Pitch Tracking by Subband Autocorrelation Classification. INTERSPEECH 2012
[c67]Thierry Bertin-Mahieux, Daniel P. W. Ellis: Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude. ISMIR 2012: 241-246
[c66]Kai Su, Mor Naaman, Avadhut Gurjar, Mohsin Patel, Daniel P. W. Ellis: Making a scene: alignment of complete sets of clips based on pairwise audio match. ICMR 2012: 26
[c65]Gerald Friedland, Daniel P. W. Ellis, Florian Metze: AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis. ACM Multimedia 2012: 1513-1514
[c64]Brian McFee, Thierry Bertin-Mahieux, Daniel P. W. Ellis, Gert R. G. Lanckriet: The million song dataset challenge. WWW (Companion Volume) 2012: 909-916- 2011
[j23]Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard, Shigeki Sagayama: Introduction to the Special Issue on Music Signal Processing. J. Sel. Topics Signal Processing 5(6): 1085-1087 (2011)
[j22]Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard: Signal Processing for Music Analysis. J. Sel. Topics Signal Processing 5(6): 1088-1110 (2011)
[j21]Graham Grindlay, Daniel P. W. Ellis: Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments. J. Sel. Topics Signal Processing 5(6): 1159-1169 (2011)
[j20]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis: Combining localization cues and source model constraints for binaural source separation. Speech Communication 53(5): 606-621 (2011)
[c63]Thierry Bertin-Mahieux, Graham Grindlay, Ron J. Weiss, Daniel P. W. Ellis: Evaluating music sequence models through missing data. ICASSP 2011: 177-180
[c62]Christos Vezyrtzis, Aaron E. Klein, Dan Ellis, Yannis P. Tsividis: Direct processing of mpeg audio using companding and BFP techniques. ICASSP 2011: 361-364
[c61]Courtenay V. Cotton, Daniel P. W. Ellis, Alexander C. Loui: Soundtrack classification by transient events. ICASSP 2011: 473-476
[c60]Daniel P. W. Ellis, Xiaohong Zeng, Josh H. McDermott: Classifying soundtracks with audio texture features. ICASSP 2011: 5880-5883
[c59]Fadi Biadsy, Julia Hirschberg, Daniel P. W. Ellis: Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors. INTERSPEECH 2011: 745-748
[c58]Thierry Bertin-Mahieux, Daniel P. W. Ellis, Brian Whitman, Paul Lamere: The Million Song Dataset. ISMIR 2011: 591-596
[c57]Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel P. W. Ellis, Alexander C. Loui: Consumer video understanding: a benchmark database and an evaluation of human and machine performance. ICMR 2011: 29
[c56]Courtenay V. Cotton, Daniel P. W. Ellis: Spectral vs. spectro-temporal features for acoustic event detection. WASPAA 2011: 69-72
[c55]Thierry Bertin-Mahieux, Daniel P. W. Ellis: Large-scale cover song recognition using hashed chroma landmarks. WASPAA 2011: 117-120
[c54]- 2010
[j19]Ron J. Weiss, Daniel P. W. Ellis: Speech separation using speaker-adapted eigenvoice speech models. Computer Speech & Language 24(1): 16-29 (2010)
[j18]Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis: Model-Based Expectation-Maximization Source Separation and Localization. IEEE Transactions on Audio, Speech & Language Processing 18(2): 382-394 (2010)
[j17]Keansub Lee, Daniel P. W. Ellis: Audio-Based Semantic Concept Classification for Consumer Video. IEEE Transactions on Audio, Speech & Language Processing 18(6): 1406-1416 (2010)
[j16]Michael I. Mandel, S. Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis: Evaluating Source Separation Algorithms With Reverberant Speech. IEEE Transactions on Audio, Speech & Language Processing 18(7): 1872-1883 (2010)
[j15]Wei Jiang, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui: Audio-visual atoms for generic video concept classification. TOMCCAP 6(3) (2010)
[c53]Suman V. Ravuri, Daniel P. W. Ellis: Cover song detection: From high scores to general classification. ICASSP 2010: 65-68
[c52]Keansub Lee, Daniel P. W. Ellis, Alexander C. Loui: Detecting local semantic concepts in environmental sounds using Markov model based clustering. ICASSP 2010: 2278-2281
[c51]Courtenay V. Cotton, Daniel P. W. Ellis: Audio fingerprinting to identify multiple videos of an event. ICASSP 2010: 2386-2389
[c50]Graham Grindlay, Daniel P. W. Ellis: A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription. ISMIR 2010: 21-26
[c49]Thierry Bertin-Mahieux, Ron J. Weiss, Daniel P. W. Ellis: Clustering Beat-Chroma Patterns in a Large Music Database. ISMIR 2010: 111-116
[c48]Yu-Gang Jiang, Xiaohong Zeng, Guangnan Ye, Dan Ellis, Shih-Fu Chang, Subhabrata Bhattacharya, Mubarak Shah: Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching. TRECVID 2010
2000 – 2009
- 2009
[j14]Jesper Højvang Jensen, Mads Græsbøll Christensen, Daniel P. W. Ellis, Søren Holdt Jensen: Quantitative Analysis of a Common Audio Similarity Measure. IEEE Transactions on Audio, Speech & Language Processing 17(4): 693-703 (2009)
[c47]Ron J. Weiss, Daniel P. W. Ellis: A variational EM algorithm for learning eigenvoice parameters in mixed signals. ICASSP 2009: 113-116
[c46]Douglas Eck, Dan Ellis, Philippe Hamel: Workshop summary: Sparse methods for music audio. ICML 2009: 171
[c45]Adrian Weller, Daniel P. W. Ellis, Tony Jebara: Structured Prediction Models for Chord Transcription of Music Audio. ICMLA 2009: 590-595
[c44]Jon Gudnason, Mark R. P. Thomas, Patrick A. Naylor, Daniel P. W. Ellis: Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling. INTERSPEECH 2009: 108-111
[c43]Wei Jiang, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui: Short-term audio-visual atoms for generic video concept classification. ACM Multimedia 2009: 5-14
[c42]Christine Smit, Daniel P. W. Ellis: Guided harmonic sinusoid estimation in a multi-pitch environment. WASPAA 2009: 41-44
[c41]Johanna Devaney, Michael I. Mandel, Daniel P. W. Ellis: Improving MIDI-audio alignment with acoustic features. WASPAA 2009: 45-48
[c40]Graham Grindlay, Daniel P. W. Ellis: Multi-voice polyphonic music transcription using eigeninstruments. WASPAA 2009: 53-56
[c39]Michael I. Mandel, Daniel P. W. Ellis: The Ideal Interaural Parameter Mask: A bound on binaural separation systems. WASPAA 2009: 85-88
[c38]Courtenay V. Cotton, Daniel P. W. Ellis: Finding similar acoustic events using matching pursuit and locality-sensitive hashing. WASPAA 2009: 125-128- 2008
[c37]Keansub Lee, Daniel P. W. Ellis: Detecting music in ambient audio by long-window autocorrelation. ICASSP 2008: 9-12
[c36]Daniel P. W. Ellis, Courtenay V. Cotton, Michael I. Mandel: Cross-correlation of beat-synchronous representations for music similarity. ICASSP 2008: 57-60
[c35]Jesper Højvang Jensen, Mads Græsbøll Christensen, Daniel P. W. Ellis, Søren Holdt Jensen: A tempo-insensitive distance measure for cover song identification based on chroma features. ICASSP 2008: 2209-2212
[c34]Suman V. Ravuri, Daniel P. W. Ellis: Stylization of pitch with syllable-based linear segments. ICASSP 2008: 3985-3988
[c33]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis: Source separation based on binaural cues and source model constraints. INTERSPEECH 2008: 419-422
[c32]Michael I. Mandel, Daniel P. W. Ellis: Multiple-Instance Learning for Music Information Retrieval. ISMIR 2008: 577-582- 2007
[j13]Graham E. Poliner, Daniel P. W. Ellis: A Discriminative Model for Polyphonic Piano Transcription. EURASIP J. Adv. Sig. Proc. 2007 (2007)
[j12]Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly: Using Broad Phonetic Group Experts for Improved Speech Recognition. IEEE Transactions on Audio, Speech & Language Processing 15(3): 803-812 (2007)
[j11]Graham E. Poliner, Daniel P. W. Ellis, A. F. Ehmann, Emilia Gómez, S. Streich, Beesuan Ong: Melody Transcription From Music Audio: Approaches and Evaluation. IEEE Transactions on Audio, Speech & Language Processing 15(4): 1247-1256 (2007)
[j10]Marios Athineos, Daniel P. W. Ellis: Autoregressive Modeling of Temporal Envelopes. IEEE Transactions on Signal Processing 55(11): 5237-5245 (2007)
[c31]James P. Ogle, Daniel P. W. Ellis: Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings. ICASSP (1) 2007: 233-236
[c30]Jesper Højvang Jensen, Daniel P. W. Ellis, Mads Græsbøll Christensen, Søren Holdt Jensen: Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs. ISMIR 2007: 107-108
[c29]
[c28]Michael I. Mandel, Daniel P. W. Ellis: A Web-Based Game for Collecting Music Metadata. ISMIR 2007: 365-366
[c27]Alexander C. Loui, Jiebo Luo, Shih-Fu Chang, Dan Ellis, Wei Jiang, Lyndon S. Kennedy, Keansub Lee, Akira Yanagawa: Kodak's consumer video benchmark data set: concept definition and annotation. Multimedia Information Retrieval 2007: 245-254
[c26]Shih-Fu Chang, Dan Ellis, Wei Jiang, Keansub Lee, Akira Yanagawa, Alexander C. Loui, Jiebo Luo: Large-scale multimodal semantic concept detection for consumer video. Multimedia Information Retrieval 2007: 255-264
[c25]Aiden R. Doherty, Alan F. Smeaton, Keansub Lee, Daniel P. W. Ellis: Multimodal Segmentation of Lifelog Data. RIAO 2007- 2006
[j9]
[j8]Daniel P. W. Ellis, Keansub Lee: Accessing Minimal-Impact Personal Audio Archives. IEEE MultiMedia 13(4): 30-38 (2006)
[j7]Daniel P. W. Ellis, Graham E. Poliner: Classification-based melody transcription. Machine Learning 65(2-3): 439-456 (2006)
[j6]Michael I. Mandel, Graham E. Poliner, Daniel P. W. Ellis: Support vector machine active learning for music retrieval. Multimedia Syst. 12(1): 3-13 (2006)
[c24]Keansub Lee, Daniel P. W. Ellis: Voice activity detection in personal audio recordings using autocorrelogram compensation. INTERSPEECH 2006
[c23]Michael I. Mandel, Daniel P. W. Ellis, Tony Jebara: An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments. NIPS 2006: 953-960- 2005
[j5]J. P. Barker, Martin P. Cooke, Daniel P. W. Ellis: Decoding speech in the presence of other sources. Speech Communication 45(1): 5-25 (2005)
[c22]Graham E. Poliner, Daniel P. W. Ellis: A Classification Approach to Melody Transcription. ISMIR 2005: 161-166
[c21]Michael I. Mandel, Dan Ellis: Song-Level Features and Support Vector Machines for Music Classification. ISMIR 2005: 594-599- 2004
[j4]Martin P. Cooke, Daniel P. W. Ellis: Introduction to the special issue on the recognition and organization of real-world sound. Speech Communication 43(4): 273-274 (2004)
[c20]Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis: LP-TRAP: linear predictive temporal patterns. INTERSPEECH 2004
[c19]Dan Ellis, John Arroyo: Eigenrhythms: Drum pattern basis sets for classification and generation. ISMIR 2004
[c18]- 2003
[c17]Adam Berenzweig, Daniel P. W. Ellis, Steve Lawrence: Anchor space for classification and similarity measurement of music. ICME 2003: 29-32
[c16]Manuel J. Reyes Gomez, Daniel P. W. Ellis: Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling. ICME 2003: 73-76
[c15]Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly: Using mutual information to design class-specific phone recognizers. INTERSPEECH 2003
[c14]Adam Berenzweig, Beth Logan, Daniel P. W. Ellis, Brian Whitman: A large-scale evalutation of acoustic and subjective music similarity measures. ISMIR 2003
[c13]Alexander Sheh, Daniel P. W. Ellis: Chord segmentation and recognition using EM-trained hidden markov models. ISMIR 2003
[c12]Robert J. Turetsky, Daniel P. W. Ellis: Ground-truth transcriptions of real music from force-aligned MIDI syntheses. ISMIR 2003
[c11]- 2002
[j3]Anthony J. Robinson, G. D. Cook, Daniel P. W. Ellis, Eric Fosler-Lussier, Steve Renals, D. A. G. Williams: Connectionist speech recognition of Broadcast News. Speech Communication 37(1-2): 27-45 (2002)
[c10]Manuel J. Reyes Gomez, Daniel P. W. Ellis: Error visualization for tandem acoustic modeling on the Aurora task. ICASSP 2002: 4176
[c9]Daniel P. W. Ellis, Brian Whitman, Adam Berenzweig, Steve Lawrence: The Quest for Ground Truth in Musical Artist Similarity. ISMIR 2002- 2001
[j2]Martin Cooke, Daniel P. W. Ellis: The auditory organization of speech and other sources in listeners and computational models. Speech Communication 35(3-4): 141-177 (2001)
[c8]Daniel P. W. Ellis, Manuel J. Reyes Gomez: Investigations into tandem acoustic modeling for the Aurora task. INTERSPEECH 2001: 189-192- 2000
[c7]Daniel P. W. Ellis, Jeff A. Bilmes: Using mutual information to design feature combinations. INTERSPEECH 2000: 79-82
[c6]Jon Barker, Martin Cooke, Daniel P. W. Ellis: Decoding speech in the presence of other sound sources. INTERSPEECH 2000: 270-273
[c5]Javier Ferreiros López, Daniel P. W. Ellis: Using acoustic condition clustering to improve acoustic change detection on broadcast news. INTERSPEECH 2000: 568-571
1990 – 1999
- 1999
[j1]Daniel P. W. Ellis: Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures. Speech Communication 27(3-4): 281-298 (1999)
[c4]Adam Janin, Dan Ellis, Nelson Morgan: Multi-stream speech recognition: ready for prime time? EUROSPEECH 1999
[c3]Gethin Williams, Daniel P. W. Ellis: Speech/music discrimination based on posterior probability features. EUROSPEECH 1999
[c2]Dave Abberley, Steve Renals, Dan Ellis, Anthony J. Robinson: The THISL SDR System At TREC-8. TREC 1999- 1994
[c1]Dan Ellis: Barefoot multimedia, or, All is not what it seems, Moriarty. Interactive Multimedia in University Education 1994: 151-154
Coauthor Index
data released under the ODC-BY 1.0 license. See also our legal information page
last updated on 2013-01-30 21:03 CET by the dblp team



