17. KDD 2011:
San Diego,
CA,
USA
Chid Apté, Joydeep Ghosh, Padhraic Smyth (Eds.):
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011.
ACM 2011, ISBN 978-1-4503-0813-7
Keynote address 1
- Stephen Boyd:
Convex optimization: from embedded real-time to large-scale distributed.
1
Keynote address 2
Keynote address 3
Keynote address 4
Classification
Matrix factorization
Graph analysis
Web user modeling
- Amr Ahmed, Yucheng Low, Mohamed Aly, Vanja Josifovski, Alexander J. Smola:
Scalable distributed inference of dynamic user interests for behavioral targeting.
114-122
- Yucheng Low, Deepak Agarwal, Alexander J. Smola:
Multiple domain user personalization.
123-131
- Deepak Agarwal, Bee-Chung Chen, Pradheep Elango, Xuanhui Wang:
Click shaping to optimize multiple objectives.
132-140
- Aditya Krishna Menon, Krishna Prasad Chitrapura, Sachin Garg, Deepak Agarwal, Nagaraj Kota:
Response prediction using collaborative filtering with hierarchies and side-information.
141-149
User modeling
Online data and streams
- Peng Zhang, Jun Li, Peng Wang, Byron J. Gao, Xingquan Zhu, Li Guo:
Enabling fast prediction for ensemble models on data streams.
177-185
- Josh Attenberg, Foster J. Provost:
Online active inference and learning.
186-194
- Wei Chu, Martin Zinkevich, Lihong Li, Achint Thomas, Belle L. Tseng:
Unbiased online active learning in data streams.
195-203
- Hamed Valizadegan, Rong Jin, Shijun Wang:
Learning to trade off between exploration and exploitation in multiclass bandit prediction.
204-212
Deployed 1
- Kunal Mukerjee, Todd Porter, Sorin Gherman:
Linear scale semantic mining algorithms in microsoft SQL server's semantic platform.
213-221
- Yanfang Ye, Tao Li, Shenghuo Zhu, Weiwei Zhuang, Egemen Tas, Umesh Gupta, Melih Abdulhayoglu:
Combining file content and file relations for cloud based malware detection.
222-230
- Ron Bekkerman, Matan Gavish:
High-precision phrase-based document classification on a modern scale.
231-239
- Feng Chen, Jing Dai, Bingsheng Wang, Sambit Sahu, Milind R. Naphade, Chang-Tien Lu:
Activity analysis based on low sample rate smart meters.
240-248
Deployed 2
- Ahmed Metwally, Matt Paduano:
Estimating the number of users behind ip addresses for combating abusive traffic.
249-257
- Xuhui Shao, Lexin Li:
Data-driven multi-touch attribution models.
258-264
- Ying Cui, Ruofei Zhang, Wei Li, Jianchang Mao:
Bid landscape forecasting in online ad exchange marketplace.
265-273
- D. Sculley, Matthew Eric Otey, Michael Pohl, Bridget Spitznagel, John Hainsworth, Yunkai Zhou:
Detecting adversarial advertisements in the wild.
274-282
Discovery
- Li Zheng, Chao Shen, Liang Tang, Tao Li, Steven Luis, Shu-Ching Chen:
Applying data mining techniques to address disaster information management challenges on mobile devices.
283-291
- Chunyu Luo, Hui Xiong, Wenjun Zhou, Yanhong Guo, Guishi Deng:
Enhancing investment decisions in P2P lending: an investor composition perspective.
292-300
- Daniel P. McCloskey, Michael E. Kress, Susan P. Imberman, Igor Kushnir, Susan Briffa-Mirabella:
From market baskets to mole rats: using data mining techniques to analyze RFID data describing laboratory animal behavior.
301-306
- Prasad Gabbur, Sharath Pankanti, Quanfu Fan, Hoang Trinh:
A pattern discovery approach to retail fraud detection.
307-315
Emerging 1
- Jing Yuan, Yu Zheng, Xing Xie, Guangzhong Sun:
Driving with knowledge from the physical world.
316-324
- Rayid Ghani, Mohit Kumar:
Interactive learning for efficiently detecting errors in insurance claims.
325-333
- Amol Ghoting, Prabhanjan Kambadur, Edwin P. D. Pednault, Ramakrishnan Kannan:
NIMBLE: a toolkit for the implementation of parallel data mining and machine learning algorithms on mapreduce.
334-342
- Dean Cerrato, Rosie Jones, Avinash Gupta:
Classification of proxy labeled examples for marketing segment generation.
343-350
- Rakesh Agrawal, Samuel Ieong, Raja Velu:
Ameliorating buyer's remorse.
351-359
Emerging 2
- Debprakash Patnaik, Patrick Butler, Naren Ramakrishnan, Laxmi Parida, Benjamin J. Keller, David A. Hanauer:
Experiences with mining temporal event sequences from electronic medical records: initial successes and some challenges.
360-368
- György J. Simon, Peter W. Li, Clifford R. Jack Jr., Prashanthi Vemuri:
Understanding atrophy trajectories in alzheimer's disease using association rules on MRI images.
369-376
- Bruno Pradel, Savaneary Sean, Julien Delporte, Sébastien Guérif, Céline Rouveirol, Nicolas Usunier, Françoise Fogelman-Soulié, Frédéric Dufau-Joël:
A case study in a recommender system based on purchase data.
377-385
- Feilong Chen, Supranamaya Ranjan, Pang-Ning Tan:
Detecting bots via incremental LS-SVM learning with dynamic feature adaptation.
386-394
- Hani Neuvirth, Michal Ozery-Flato, Jianying Hu, Jonathan Laserson, Martin S. Kohn, Shahram Ebadollahi, Michal Rosen-Zvi:
Toward personalized care management of patients at risk: the diabetes case study.
395-403
Emerging 3
Text mining 1
Text mining 2
Privacy
Social networks
- Mao Ye, Dong Shou, Wang-Chien Lee, Peifeng Yin, Krzysztof Janowicz:
On the semantic annotation of places in location-based social networks.
520-528
- Michael Mathioudakis, Francesco Bonchi, Carlos Castillo, Aristides Gionis, Antti Ukkonen:
Sparsification of influence networks.
529-537
- Mahashweta Das, Gautam Das, Vagelis Hristidis:
Leveraging collaborative tagging for web item design.
538-546
Theory
Frequent sets
Text mining 3
Unsupervised learning
Graph mining
Scalability
Predictive modeling
Demonstration track
- Yong Ge, Chuanren Liu, Hui Xiong, Jian Chen:
A taxi business intelligence system.
735-738
- Duen Horng Chau, Aniket Kittur, Jason I. Hong, Christos Faloutsos:
Apolo: interactive large graph sensemaking by combining machine learning and visualization.
739-742
- Jian Fan, Ping Luo, Suk Hwan Lim, Sam Liu, Joshi Parag, Jerry Liu:
Article clipper: a system for web article extraction.
743-746
- Robert S. Sinkovits, Pietro Cicotti, Shawn Strande, Mahidhar Tatineni, Paul Rodriguez, Nicole Wolter, Natasha Balac:
Data intensive analysis on the gordon high performance data and compute system.
747-748
- Jakub Piskorski, Martin Atkinson:
Frontex real-time news event extraction framework.
749-752
- Xin Jin, Chi Wang, Jiebo Luo, Xiao Yu, Jiawei Han:
LikeMiner: a system for mining the power of 'like' in social media networks.
753-756
- Bart Goethals, Sandy Moens, Jilles Vreeken:
MIME: a framework for interactive visual pattern mining.
757-760
- Gustavo E. A. P. A. Batista, Eamonn J. Keogh, Agenor Mafra-Neto, Edgar Rowton:
SIGKDD demo: sensors and software to allow computational entomology, an emerging application of data mining.
761-764
- Cheng-Te Li, Shou-De Lin:
Social flocks: a crowd simulation framework for social network generation, community detection, and collective behavior modeling.
765-768
- Jie Tang, Sen Wu, Bo Gao, Yang Wan:
Topic-level social network search.
769-772
- Harikrishna G. N. Rai, Kishore Jonna, P. Radha Krishna:
Video analytics solution for tracking customer locations in retail shopping malls.
773-776
Industrial practice expo track
- David H. Reiley:
"Which half Is wasted?": controlled experiments to measure online-advertising effectiveness.
777
- Mario E. Inchiosa:
Accelerating large-scale data mining using in-database analytics.
778
- Ravi Vijayaraghavan, P. V. Kannan:
Applications of data mining and machine learning in online customer care.
779
- Dan Steinberg, Felipe Fernandez Martinez:
Broad scale predictive modeling and marketing optimization in retail sales.
780
- Paul A. Rejto:
Knowledge discovery and data mining in pharmaceutical cancer research.
781
- Colleen McCue:
Operational security analytics: doing more with less.
782
- Tai Hsu:
Real-time risk control system for CNP (card not present).
783
- David Norton:
The power of analysis and data.
784
- Richard Boire:
The practitioner's viewpoint to data mining: key lessons learned in the trenches and case studies.
785
- John F. Elder IV:
Thriving as a data miner in the real world.
786
Poster session
- Luís Torgo, Orlando Ohashi:
2D-interval predictions for time series.
787-794
- Faris Alqadah, Raj Bhatnagar:
A game theoretic framework for heterogenous information network clustering.
795-804
- Andrew Cotter, Nathan Srebro, Joseph Keshet:
A GPU-tailored approach for training kernelized SVMs.
805-813
- Jiayu Zhou, Lei Yuan, Jun Liu, Jieping Ye:
A multi-task learning formulation for predicting disease progression.
814-822
- György J. Simon, Vipin Kumar, Peter W. Li:
A simple statistical model and association rule filtering for classification.
823-831
- Liangjie Hong, Byron Dom, Siva Gurumurthy, Kostas Tsioutsiouliklis:
A time-dependent topic model for multiple text streams.
832-840
- Cristopher Moore, Xiaoran Yan, Yaojia Zhu, Jean-Baptiste Rouquier, Terran Lane:
Active learning for node classification in assortative and disassortative networks.
841-849
- Chris Mesterharm, Michael J. Pazzani:
Active learning using on-line algorithms.
850-858
- Kanishka Bhaduri, Bryan L. Matthews, Chris Giannella:
Algorithms for speeding up distance-based outlier detection.
859-867
- Hardy Kremer, Philipp Kranen, Timm Jansen, Thomas Seidl, Albert Bifet, Geoff Holmes, Bernhard Pfahringer:
An effective evaluation measure for clustering on evolving data streams.
868-876
- Xueyuan Zhou, Mikhail Belkin, Nathan Srebro:
An iterated graph laplacian approach for ranking on manifolds.
877-885
- Ruoyi Jiang, Hongliang Fei, Jun Huan:
Anomaly localization for network data streams with graph joint sparse PCA.
886-894
- Radha Chitta, Rong Jin, Timothy C. Havens, Anil K. Jain:
Approximate kernel k-means: solution to large scale kernel clustering.
895-903
- Parisa Rashidi, Diane J. Cook:
Ask me better questions: active learning queries based on rule induction.
904-912
- Yehuda Koren, Edo Liberty, Yoelle Maarek, Roman Sandler:
Automatically tagging email by leveraging other users' folders.
913-921
- Ruoming Jin, Victor E. Lee, Hui Hong:
Axiomatic ranking of network role similarity.
922-930
- Shuai Huang, Jing Li, Jieping Ye, Adam Fleisher, Kewei Chen, Teresa Wu, Eric Reiman:
Brain effective connectivity modeling for alzheimer's disease by sparse gaussian bayesian network.
931-939
- Francisco Pereira, Matthew Botvinick:
Classification of functional magnetic resonance imaging data using informative pattern features.
940-946
- Eric Yi Liu, Zhaojun Zhang, Wei Wang:
Clustering with relative constraints.
947-955
- Huahua Wang, Arindam Banerjee, Daniel Boley:
Common component analysis for multiple covariance matrices.
956-964
- Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, Atte Hinkka:
Compression of weighted graphs.
965-973
- V. G. Vinod Vydiswaran, ChengXiang Zhai, Dan Roth:
Content-driven trust propagation framework.
974-982
- Yong Ge, Qi Liu, Hui Xiong, Alexander Tuzhilin, Jian Chen:
Cost-aware travel tour recommendation.
983-991
- Ruoming Jin, Lin Liu, Charu C. Aggarwal:
Discovering highly reliable subgraphs in uncertain graphs.
992-1000
- Xiaoxiao Shi, Wei Fan, Jianping Zhang, Philip S. Yu:
Discovering shakers from evolving entities via cascading graph inference.
1001-1009
- Wei Liu, Yu Zheng, Sanjay Chawla, Jing Yuan, Xie Xing:
Discovering spatio-temporal causal interactions in traffic data streams.
1010-1018
- Panagiotis Papadimitriou, Hector Garcia-Molina, Prabhakar Krishnamurthy, Randall A. Lewis, David H. Reiley:
Display advertising impact: search lift and social influence.
1019-1027
- Hanghang Tong, Jingrui He, Zhen Wen, Ravi Konuru, Ching-Yung Lin:
Diversified ranking on large graphs: an optimization viewpoint.
1028-1036
- Saurabh Kataria, Krishnan S. Kumar, Rajeev Rastogi, Prithviraj Sen, Srinivasan H. Sengamedu:
Entity disambiguation with hierarchical topic models.
1037-1045
- Salvatore Scellato, Anastasios Noulas, Cecilia Mascolo:
Exploiting place features in link prediction on location-based social networks.
1046-1054
- Kazuo Aoyama, Kazumi Saito, Hiroshi Sawada, Naonori Ueda:
Fast approximate similarity search based on degree-reduced neighborhood graphs.
1055-1063
- Cho-Jui Hsieh, Inderjit S. Dhillon:
Fast coordinate descent methods with variable selection for non-negative matrix factorization.
1064-1072
- Anirban Dasgupta, Ravi Kumar, Tamás Sarlós:
Fast locality-sensitive hashing.
1073-1081
- Eunjoon Cho, Seth A. Myers, Jure Leskovec:
Friendship and mobility: user movement in location-based social networks.
1082-1090
- U. Kang, Hanghang Tong, Jimeng Sun, Ching-Yung Lin, Christos Faloutsos:
GBASE: a scalable and general graph management system.
1091-1099
- Dashun Wang, Dino Pedreschi, Chaoming Song, Fosca Giannotti, Albert-László Barabási:
Human mobility, social ties, and link prediction.
1100-1108
- Gideon Dror, Yehuda Koren, Yoelle Maarek, Idan Szpektor:
I want to answer; who has a question?: Yahoo! answers recommender system.
1109-1117
- Amit Dhurandhar:
Improving predictions using aggregate information.
1118-1126
- Claudia Plant, Christian Böhm:
INCONCO: interpretable clustering of numerical and categorical objects.
1127-1135
- Sean Gilpin, Ian Davidson:
Incorporating SAT solvers into hierarchical clustering algorithms: an efficient and flexible approach.
1136-1144
- Yan Liu, Pei-yun Hseuh, Rick Lawrence, Steve Meliksetian, Claudia Perlich, Alejandro Veen:
Latent graphical models for quantifying and predicting patent quality.
1145-1153
- Abdullah Mueen, Eamonn J. Keogh, Neal Young:
Logical-shapelets: an expressive primitive for time series classification.
1154-1162
- Puja Das, Arindam Banerjee:
Meta optimization and its application to portfolio selection.
1163-1171
- Nikolaj Tatti, Boris Cule:
Mining closed episodes with simultaneous events.
1172-1180
- Neal Lathia, Licia Capra:
Mining mobility data to minimise travellers' spending on public transport.
1181-1189
- Roberto Trasarti, Fabio Pinelli, Mirco Nanni, Fosca Giannotti:
Mining mobility user profiles for car pooling.
1190-1198
- Zhongang Qi, Ming Yang, Zhongfei (Mark) Zhang, Zhengyou Zhang:
Mining partially annotated images.
1199-1207
- Dan Zhang, Jingrui He, Yan Liu, Luo Si, Richard D. Lawrence:
Multi-view transfer learning with a large margin approach.
1208-1216
- Michaek Kwok-Po Ng, Xutao Li, Yunming Ye:
MultiRank: co-ranking for objects and relations in multi-relational data.
1217-1225
- Charu C. Aggarwal, Yan Xie, Philip S. Yu:
On dynamic data-driven selection of sensor streams.
1226-1234
- Pedram Pedarsani, Matthias Grossglauser:
On the privacy of anonymized networks.
1235-1243
- Shan Jiang, Lidong Bing, Bai Sun, Yan Zhang, Wai Lam:
Ontology enhancement and concept granularity learning: keeping yourself current and adaptive.
1244-1252
- Graham Cormode:
Personal privacy vs population privacy: learning to attack anonymization.
1253-1261
- Chih-Hua Tai, Philip S. Yu, De-Nian Yang, Ming-Syan Chen:
Privacy-preserving social network publication against friendship attacks.
1262-1270
- Hongbo Deng, Jiawei Han, Bo Zhao, Yintao Yu, Cindy Xide Lin:
Probabilistic topic models with biased propagation on heterogeneous information networks.
1271-1279
- Xiao Jiang, Chengkai Li, Ping Luo, Min Wang, Yong Yu:
Prominent streak discovery in sequence data.
1280-1288
- Byoungyoung Lee, Jinoh Oh, Hwanjo Yu, Jong Kim:
Protecting location privacy using location semantics.
1289-1297
- Ming Ji, Jiawei Han, Marina Danilevsky:
Ranking-based classification of heterogeneous information networks.
1298-1306
- Ye Chen, Pavel Berkhin, Bo Anderson, Nikhil R. Devanur:
Real-time bidding algorithms for performance-based display ad allocation.
1307-1315
- Aris Gkoulalas-Divanis, Grigorios Loukides:
Revisiting sequential pattern hiding to enhance utility.
1316-1324
- Nilesh N. Dalvi, Ravi Kumar, Ashwin Machanavajjhala, Vibhor Rastogi:
Sampling hidden objects using nearest-neighbor oracles.
1325-1333
- Shrikant Kashyap, Panagiotis Karras:
Scalable kNN search on vertically stored time series.
1334-1342
- Dan Zhang, Yan Liu, Luo Si:
Serendipitous learning: learning beyond the predefined label space.
1343-1351
- Vuk Malbasa, Slobodan Vucetic:
Spatially regularized logistic regression for disease mapping on large moving populations.
1352-1360
- Nagaraj Kota, Deepak Agarwal:
Temporal multi-hierarchy smoothing for estimating rates of rare events.
1361-1369
- Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, Christos Faloutsos:
ThermoCast: a cyber-physical forecasting model for datacenters.
1370-1378
- Chedy Raïssi, Jian Pei:
Towards bounding sequential patterns.
1379-1387
- Yuchen Zhang, Weizhu Chen, Dong Wang, Qiang Yang:
User-click modeling for understanding and predicting search-behavior.
1388-1396
- Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, Ping Li:
User-level sentiment analysis incorporating social networks.
1397-1405
- Sandeepkumar Satpal, Sahely Bhadra, Sundararajan Sellamanickam, Rajeev Rastogi, Prithviraj Sen:
Web information extraction using markov logic networks.
1406-1414
Last update Thu May 24 03:26:21 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page