Dan Ellis
List of publications from the DBLP Bibliography Server - FAQ| 2012 | ||
|---|---|---|
| j24 | Jon Gudnason, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor: Data-driven voice source waveform analysis and synthesis. Speech Communication 54(2): 199-211 (2012) | |
| c68 | Byung Suk Lee, Daniel P. W. Ellis: Noise Robust Pitch Tracking by Subband Autocorrelation Classification. INTERSPEECH 2012 | |
| c67 | Thierry Bertin-Mahieux, Daniel P. W. Ellis: Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude. ISMIR 2012: 241-246 | |
| c66 | Kai Su, Mor Naaman, Avadhut Gurjar, Mohsin Patel, Daniel P. W. Ellis: Making a scene: alignment of complete sets of clips based on pairwise audio match. ICMR 2012: 26 | |
| c65 | Gerald Friedland, Daniel P. W. Ellis, Florian Metze: AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis. ACM Multimedia 2012: 1513-1514 | |
| c64 | Brian McFee, Thierry Bertin-Mahieux, Daniel P. W. Ellis, Gert R. G. Lanckriet: The million song dataset challenge. WWW (Companion Volume) 2012: 909-916 | |
| 2011 | ||
| j23 | Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard, Shigeki Sagayama: Introduction to the Special Issue on Music Signal Processing. J. Sel. Topics Signal Processing 5(6): 1085-1087 (2011) | |
| j22 | Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard: Signal Processing for Music Analysis. J. Sel. Topics Signal Processing 5(6): 1088-1110 (2011) | |
| j21 | Graham Grindlay, Daniel P. W. Ellis: Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments. J. Sel. Topics Signal Processing 5(6): 1159-1169 (2011) | |
| j20 | Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis: Combining localization cues and source model constraints for binaural source separation. Speech Communication 53(5): 606-621 (2011) | |
| c63 | Thierry Bertin-Mahieux, Graham Grindlay, Ron J. Weiss, Daniel P. W. Ellis: Evaluating music sequence models through missing data. ICASSP 2011: 177-180 | |
| c62 | Christos Vezyrtzis, Aaron E. Klein, Dan Ellis, Yannis P. Tsividis: Direct processing of mpeg audio using companding and BFP techniques. ICASSP 2011: 361-364 | |
| c61 | Courtenay V. Cotton, Daniel P. W. Ellis, Alexander C. Loui: Soundtrack classification by transient events. ICASSP 2011: 473-476 | |
| c60 | Daniel P. W. Ellis, Xiaohong Zeng, Josh H. McDermott: Classifying soundtracks with audio texture features. ICASSP 2011: 5880-5883 | |
| c59 | Fadi Biadsy, Julia Hirschberg, Daniel P. W. Ellis: Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors. INTERSPEECH 2011: 745-748 | |
| c58 | Thierry Bertin-Mahieux, Daniel P. W. Ellis, Brian Whitman, Paul Lamere: The Million Song Dataset. ISMIR 2011: 591-596 | |
| c57 | Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel P. W. Ellis, Alexander C. Loui: Consumer video understanding: a benchmark database and an evaluation of human and machine performance. ICMR 2011: 29 | |
| c56 | Courtenay V. Cotton, Daniel P. W. Ellis: Spectral vs. spectro-temporal features for acoustic event detection. WASPAA 2011: 69-72 | |
| c55 | Thierry Bertin-Mahieux, Daniel P. W. Ellis: Large-scale cover song recognition using hashed chroma landmarks. WASPAA 2011: 117-120 | |
| c54 | ||
| 2010 | ||
| j19 | Ron J. Weiss, Daniel P. W. Ellis: Speech separation using speaker-adapted eigenvoice speech models. Computer Speech & Language 24(1): 16-29 (2010) | |
| j18 | Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis: Model-Based Expectation-Maximization Source Separation and Localization. IEEE Transactions on Audio, Speech & Language Processing 18(2): 382-394 (2010) | |
| j17 | Keansub Lee, Daniel P. W. Ellis: Audio-Based Semantic Concept Classification for Consumer Video. IEEE Transactions on Audio, Speech & Language Processing 18(6): 1406-1416 (2010) | |
| j16 | Michael I. Mandel, S. Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis: Evaluating Source Separation Algorithms With Reverberant Speech. IEEE Transactions on Audio, Speech & Language Processing 18(7): 1872-1883 (2010) | |
| j15 | Wei Jiang, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui: Audio-visual atoms for generic video concept classification. TOMCCAP 6(3) (2010) | |
| c53 | Suman V. Ravuri, Daniel P. W. Ellis: Cover song detection: From high scores to general classification. ICASSP 2010: 65-68 | |
| c52 | Keansub Lee, Daniel P. W. Ellis, Alexander C. Loui: Detecting local semantic concepts in environmental sounds using Markov model based clustering. ICASSP 2010: 2278-2281 | |
| c51 | Courtenay V. Cotton, Daniel P. W. Ellis: Audio fingerprinting to identify multiple videos of an event. ICASSP 2010: 2386-2389 | |
| c50 | Graham Grindlay, Daniel P. W. Ellis: A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription. ISMIR 2010: 21-26 | |
| c49 | Thierry Bertin-Mahieux, Ron J. Weiss, Daniel P. W. Ellis: Clustering Beat-Chroma Patterns in a Large Music Database. ISMIR 2010: 111-116 | |
| c48 | Yu-Gang Jiang, Xiaohong Zeng, Guangnan Ye, Dan Ellis, Shih-Fu Chang, Subhabrata Bhattacharya, Mubarak Shah: Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching. TRECVID 2010 | |
| 2009 | ||
| j14 | Jesper Højvang Jensen, Mads Græsbøll Christensen, Daniel P. W. Ellis, Søren Holdt Jensen: Quantitative Analysis of a Common Audio Similarity Measure. IEEE Transactions on Audio, Speech & Language Processing 17(4): 693-703 (2009) | |
| c47 | Ron J. Weiss, Daniel P. W. Ellis: A variational EM algorithm for learning eigenvoice parameters in mixed signals. ICASSP 2009: 113-116 | |
| c46 | Douglas Eck, Dan Ellis, Philippe Hamel: Workshop summary: Sparse methods for music audio. ICML 2009: 171 | |
| c45 | Adrian Weller, Daniel P. W. Ellis, Tony Jebara: Structured Prediction Models for Chord Transcription of Music Audio. ICMLA 2009: 590-595 | |
| c44 | Jon Gudnason, Mark R. P. Thomas, Patrick A. Naylor, Daniel P. W. Ellis: Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling. INTERSPEECH 2009: 108-111 | |
| c43 | Wei Jiang, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui: Short-term audio-visual atoms for generic video concept classification. ACM Multimedia 2009: 5-14 | |
| c42 | Christine Smit, Daniel P. W. Ellis: Guided harmonic sinusoid estimation in a multi-pitch environment. WASPAA 2009: 41-44 | |
| c41 | Johanna Devaney, Michael I. Mandel, Daniel P. W. Ellis: Improving MIDI-audio alignment with acoustic features. WASPAA 2009: 45-48 | |
| c40 | Graham Grindlay, Daniel P. W. Ellis: Multi-voice polyphonic music transcription using eigeninstruments. WASPAA 2009: 53-56 | |
| c39 | Michael I. Mandel, Daniel P. W. Ellis: The Ideal Interaural Parameter Mask: A bound on binaural separation systems. WASPAA 2009: 85-88 | |
| c38 | Courtenay V. Cotton, Daniel P. W. Ellis: Finding similar acoustic events using matching pursuit and locality-sensitive hashing. WASPAA 2009: 125-128 | |
| 2008 | ||
| c37 | Keansub Lee, Daniel P. W. Ellis: Detecting music in ambient audio by long-window autocorrelation. ICASSP 2008: 9-12 | |
| c36 | Daniel P. W. Ellis, Courtenay V. Cotton, Michael I. Mandel: Cross-correlation of beat-synchronous representations for music similarity. ICASSP 2008: 57-60 | |
| c35 | Jesper Højvang Jensen, Mads Græsbøll Christensen, Daniel P. W. Ellis, Søren Holdt Jensen: A tempo-insensitive distance measure for cover song identification based on chroma features. ICASSP 2008: 2209-2212 | |
| c34 | Suman V. Ravuri, Daniel P. W. Ellis: Stylization of pitch with syllable-based linear segments. ICASSP 2008: 3985-3988 | |
| c33 | Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis: Source separation based on binaural cues and source model constraints. INTERSPEECH 2008: 419-422 | |
| c32 | Michael I. Mandel, Daniel P. W. Ellis: Multiple-Instance Learning for Music Information Retrieval. ISMIR 2008: 577-582 | |
| 2007 | ||
| j13 | Graham E. Poliner, Daniel P. W. Ellis: A Discriminative Model for Polyphonic Piano Transcription. EURASIP J. Adv. Sig. Proc. 2007 (2007) | |
| j12 | Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly: Using Broad Phonetic Group Experts for Improved Speech Recognition. IEEE Transactions on Audio, Speech & Language Processing 15(3): 803-812 (2007) | |
| j11 | Graham E. Poliner, Daniel P. W. Ellis, A. F. Ehmann, Emilia Gómez, S. Streich, Beesuan Ong: Melody Transcription From Music Audio: Approaches and Evaluation. IEEE Transactions on Audio, Speech & Language Processing 15(4): 1247-1256 (2007) | |
| j10 | Marios Athineos, Daniel P. W. Ellis: Autoregressive Modeling of Temporal Envelopes. IEEE Transactions on Signal Processing 55(11): 5237-5245 (2007) | |
| c31 | James P. Ogle, Daniel P. W. Ellis: Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings. ICASSP (1) 2007: 233-236 | |
| c30 | Jesper Højvang Jensen, Daniel P. W. Ellis, Mads Græsbøll Christensen, Søren Holdt Jensen: Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs. ISMIR 2007: 107-108 | |
| c29 | ||
| c28 | Michael I. Mandel, Daniel P. W. Ellis: A Web-Based Game for Collecting Music Metadata. ISMIR 2007: 365-366 | |
| c27 | Alexander C. Loui, Jiebo Luo, Shih-Fu Chang, Dan Ellis, Wei Jiang, Lyndon S. Kennedy, Keansub Lee, Akira Yanagawa: Kodak's consumer video benchmark data set: concept definition and annotation. Multimedia Information Retrieval 2007: 245-254 | |
| c26 | Shih-Fu Chang, Dan Ellis, Wei Jiang, Keansub Lee, Akira Yanagawa, Alexander C. Loui, Jiebo Luo: Large-scale multimodal semantic concept detection for consumer video. Multimedia Information Retrieval 2007: 255-264 | |
| c25 | Aiden R. Doherty, Alan F. Smeaton, Keansub Lee, Daniel P. W. Ellis: Multimodal Segmentation of Lifelog Data. RIAO 2007 | |
| 2006 | ||
| j9 | ||
| j8 | Daniel P. W. Ellis, Keansub Lee: Accessing Minimal-Impact Personal Audio Archives. IEEE MultiMedia 13(4): 30-38 (2006) | |
| j7 | Daniel P. W. Ellis, Graham E. Poliner: Classification-based melody transcription. Machine Learning 65(2-3): 439-456 (2006) | |
| j6 | Michael I. Mandel, Graham E. Poliner, Daniel P. W. Ellis: Support vector machine active learning for music retrieval. Multimedia Syst. 12(1): 3-13 (2006) | |
| c24 | Keansub Lee, Daniel P. W. Ellis: Voice activity detection in personal audio recordings using autocorrelogram compensation. INTERSPEECH 2006 | |
| c23 | Michael I. Mandel, Daniel P. W. Ellis, Tony Jebara: An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments. NIPS 2006: 953-960 | |
| 2005 | ||
| j5 | J. P. Barker, Martin P. Cooke, Daniel P. W. Ellis: Decoding speech in the presence of other sources. Speech Communication 45(1): 5-25 (2005) | |
| c22 | Graham E. Poliner, Daniel P. W. Ellis: A Classification Approach to Melody Transcription. ISMIR 2005: 161-166 | |
| c21 | Michael I. Mandel, Dan Ellis: Song-Level Features and Support Vector Machines for Music Classification. ISMIR 2005: 594-599 | |
| 2004 | ||
| j4 | Martin P. Cooke, Daniel P. W. Ellis: Introduction to the special issue on the recognition and organization of real-world sound. Speech Communication 43(4): 273-274 (2004) | |
| c20 | Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis: LP-TRAP: linear predictive temporal patterns. INTERSPEECH 2004 | |
| c19 | Dan Ellis, John Arroyo: Eigenrhythms: Drum pattern basis sets for classification and generation. ISMIR 2004 | |
| c18 | ||
| 2003 | ||
| c17 | Adam Berenzweig, Daniel P. W. Ellis, Steve Lawrence: Anchor space for classification and similarity measurement of music. ICME 2003: 29-32 | |
| c16 | Manuel J. Reyes Gomez, Daniel P. W. Ellis: Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling. ICME 2003: 73-76 | |
| c15 | Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly: Using mutual information to design class-specific phone recognizers. INTERSPEECH 2003 | |
| c14 | Adam Berenzweig, Beth Logan, Daniel P. W. Ellis, Brian Whitman: A large-scale evalutation of acoustic and subjective music similarity measures. ISMIR 2003 | |
| c13 | Alexander Sheh, Daniel P. W. Ellis: Chord segmentation and recognition using EM-trained hidden markov models. ISMIR 2003 | |
| c12 | Robert J. Turetsky, Daniel P. W. Ellis: Ground-truth transcriptions of real music from force-aligned MIDI syntheses. ISMIR 2003 | |
| c11 | ||
| 2002 | ||
| j3 | Anthony J. Robinson, G. D. Cook, Daniel P. W. Ellis, Eric Fosler-Lussier, Steve Renals, D. A. G. Williams: Connectionist speech recognition of Broadcast News. Speech Communication 37(1-2): 27-45 (2002) | |
| c10 | Manuel J. Reyes Gomez, Daniel P. W. Ellis: Error visualization for tandem acoustic modeling on the Aurora task. ICASSP 2002: 4176 | |
| c9 | Daniel P. W. Ellis, Brian Whitman, Adam Berenzweig, Steve Lawrence: The Quest for Ground Truth in Musical Artist Similarity. ISMIR 2002 | |
| 2001 | ||
| j2 | Martin Cooke, Daniel P. W. Ellis: The auditory organization of speech and other sources in listeners and computational models. Speech Communication 35(3-4): 141-177 (2001) | |
| c8 | Daniel P. W. Ellis, Manuel J. Reyes Gomez: Investigations into tandem acoustic modeling for the Aurora task. INTERSPEECH 2001: 189-192 | |
| 2000 | ||
| c7 | Daniel P. W. Ellis, Jeff A. Bilmes: Using mutual information to design feature combinations. INTERSPEECH 2000: 79-82 | |
| c6 | Jon Barker, Martin Cooke, Daniel P. W. Ellis: Decoding speech in the presence of other sound sources. INTERSPEECH 2000: 270-273 | |
| c5 | Javier Ferreiros López, Daniel P. W. Ellis: Using acoustic condition clustering to improve acoustic change detection on broadcast news. INTERSPEECH 2000: 568-571 | |
| 1999 | ||
| j1 | Daniel P. W. Ellis: Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures. Speech Communication 27(3-4): 281-298 (1999) | |
| c4 | Adam Janin, Dan Ellis, Nelson Morgan: Multi-stream speech recognition: ready for prime time? EUROSPEECH 1999 | |
| c3 | Gethin Williams, Daniel P. W. Ellis: Speech/music discrimination based on posterior probability features. EUROSPEECH 1999 | |
| c2 | Dave Abberley, Steve Renals, Dan Ellis, Anthony J. Robinson: The THISL SDR System At TREC-8. TREC 1999 | |
| 1994 | ||
| c1 | Dan Ellis: Barefoot multimedia, or, All is not what it seems, Moriarty. Interactive Multimedia in University Education 1994: 151-154 | |
Colors in the list of coauthors
Last update Wed May 22 22:50:08 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page