| 2013 | ||
|---|---|---|
| i3 | Khashayar Rohanimanesh, Sridhar Mahadevan: Decision-Theoretic Planning with Concurrent Temporally Extended Actions. CoRR abs/1301.2307 (2013) | |
| 2012 | ||
| c62 | Hoa Trong Vu, Clifton Carey, Sridhar Mahadevan: Manifold Warping: Manifold Alignment over Time. AAAI 2012 | |
| c61 | ||
| c60 | ||
| i2 | ||
| i1 | ||
| 2011 | ||
| c59 | Chang Wang, Sridhar Mahadevan: Heterogeneous Domain Adaptation Using Manifold Alignment. IJCAI 2011: 1541-1546 | |
| c58 | Chang Wang, Sridhar Mahadevan: Jointly Learning Data-Dependent Label and Locality-Preserving Projections. IJCAI 2011: 1547-1552 | |
| c57 | Blake Foster, Sridhar Mahadevan, Rui Wang: A GPU-Based Approximate SVD Algorithm. PPAM (1) 2011: 569-578 | |
| 2010 | ||
| c56 | ||
| c55 | Georgios Theocharous, Sridhar Mahadevan: Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization. AAAI 2010 | |
| c54 | Sarah Osentoski, Sridhar Mahadevan: Basis function construction for hierarchical reinforcement learning. AAMAS 2010: 747-754 | |
| c53 | ||
| 2009 | ||
| j14 | Sridhar Mahadevan: Learning Representation and Control in Markov Decision Processes: New Frontiers. Foundations and Trends in Machine Learning 1(4): 403-565 (2009) | |
| j13 | Jeffrey Johns, Marek Petrik, Sridhar Mahadevan: Hybrid least-squares algorithms for approximate policy evaluation. Machine Learning 76(2-3): 243-256 (2009) | |
| c52 | Kimberly Ferguson, Beverly Park Woolf, Sridhar Mahadevan: Transfer Learning and Representation Discovery in Intelligent Tutoring Systems. AIED 2009: 605-607 | |
| c51 | ||
| c50 | Chang Wang, Sridhar Mahadevan: Multiscale Analysis of Document Corpora Based on Diffusion Models. IJCAI 2009: 1592-1597 | |
| c49 | Jeffrey Johns, Marek Petrik, Sridhar Mahadevan: Hybrid Least-Squares Algorithms for Approximate Policy Evaluation. ECML/PKDD (1) 2009: 9 | |
| 2008 | ||
| b1 | Sridhar Mahadevan: Representation Discovery using Harmonic Analysis. Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers 2008 | |
| c48 | Sridhar Mahadevan: Fast Spectral Learning using Lanczos Eigenspace Projections. AAAI 2008: 1472-1475 | |
| c47 | ||
| 2007 | ||
| j12 | Sridhar Mahadevan, Mauro Maggioni: Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes. Journal of Machine Learning Research 8: 2169-2231 (2007) | |
| j11 | Mohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchical Average Reward Reinforcement Learning. Journal of Machine Learning Research 8: 2629-2669 (2007) | |
| c46 | Jeffrey Johns, Sridhar Mahadevan, Chang Wang: Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization. AAAI 2007: 559-564 | |
| c45 | Ivon Arroyo, Kimberly Ferguson, Jeffrey Johns, Toby Dragon, Hasmik Meheranian, Don Fisher, Andrew G. Barto, Sridhar Mahadevan, Beverly Park Woolf: Repairing Disengagement With Non-Invasive Interventions. AIED 2007: 195-202 | |
| c44 | Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns, Kimberly Ferguson, Chang Wang: Learning to Plan Using Harmonic Analysis of Diffusion Models. ICAPS 2007: 224-231 | |
| c43 | Jeffrey Johns, Sridhar Mahadevan: Constructing basis functions from directed graphs for value function approximation. ICML 2007: 385-392 | |
| c42 | Sridhar Mahadevan: Adaptive mesh compression in 3D computer graphics using multiscale manifold learning. ICML 2007: 585-592 | |
| c41 | Sarah Osentoski, Sridhar Mahadevan: Learning state-action basis functions for hierarchical MDPs. ICML 2007: 705-712 | |
| 2006 | ||
| j10 | Mohammad Ghavamzadeh, Sridhar Mahadevan, Rajbala Makar: Hierarchical multi-agent reinforcement learning. Autonomous Agents and Multi-Agent Systems 13(2): 197-229 (2006) | |
| c40 | Sridhar Mahadevan, Mauro Maggioni, Kimberly Ferguson, Sarah Osentoski: Learning Representation and Control in Continuous Markov Decision Processes. AAAI 2006: 1194-1199 | |
| c39 | Mauro Maggioni, Sridhar Mahadevan: Fast direct policy evaluation using multiscale analysis of Markov diffusion processes. ICML 2006: 601-608 | |
| c38 | Kimberly Ferguson, Ivon Arroyo, Sridhar Mahadevan, Beverly Park Woolf, Andrew G. Barto: Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels. Intelligent Tutoring Systems 2006: 453-462 | |
| c37 | Jeffrey Johns, Sridhar Mahadevan, Beverly Park Woolf: Estimating Student Proficiency Using an Item Response Theory Model. Intelligent Tutoring Systems 2006: 473-480 | |
| 2005 | ||
| c36 | Jeffrey Johns, Sridhar Mahadevan: A Variational Learning Algorithm for the Abstract Hidden Markov Model. AAAI 2005: 9-14 | |
| c35 | Sridhar Mahadevan: Samuel Meets Amarel: Automating Value Function Approximation Using Global State Space Analysis. AAAI 2005: 1000-1005 | |
| c34 | ||
| c33 | Khashayar Rohanimanesh, Sridhar Mahadevan: Coarticulation: an approach for generating concurrent plans in Markov decision processes. ICML 2005: 720-727 | |
| c32 | Sridhar Mahadevan, Mauro Maggioni: Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions. NIPS 2005 | |
| c31 | Victoria Manfredi, Sridhar Mahadevan, James F. Kurose: Switching kalman filters for prediction and tracking in an adaptive meteorological sensing network. SECON 2005: 197-206 | |
| c30 | ||
| 2004 | ||
| c29 | Suchi Saria, Sridhar Mahadevan: Probabilistic Plan Recognition in Multiagent Systems. ICAPS 2004: 287-296 | |
| c28 | Mohammad Ghavamzadeh, Sridhar Mahadevan: Learning to Communicate and Act Using Hierarchical Reinforcement Learning. AAMAS 2004: 1114-1121 | |
| c27 | Sarah Osentoski, Victoria Manfredi, Sridhar Mahadevan: Learning hierarchical models of activity. IROS 2004: 891-896 | |
| c26 | Khashayar Rohanimanesh, Robert Platt Jr., Sridhar Mahadevan, Roderic A. Grupen: Coarticulation in Markov Decision Processes. NIPS 2004 | |
| 2003 | ||
| j9 | Andrew G. Barto, Sridhar Mahadevan: Recent Advances in Hierarchical Reinforcement Learning. Discrete Event Dynamic Systems 13(1-2): 41-77 (2003) | |
| j8 | Andrew G. Barto, Sridhar Mahadevan: Recent Advances in Hierarchical Reinforcement Learning. Discrete Event Dynamic Systems 13(4): 341-379 (2003) | |
| c25 | Mohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchical Policy Gradient Algorithms. ICML 2003: 226-233 | |
| 2002 | ||
| c24 | Mohammad Ghavamzadeh, Sridhar Mahadevan: A multiagent reinforcement learning algorithm by dynamically merging markov decision processes. AAMAS 2002: 845-846 | |
| c23 | Mohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchically Optimal Average Reward Reinforcement Learning. ICML 2002: 195-202 | |
| c22 | Georgios Theocharous, Sridhar Mahadevan: Approximate Planning with Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation. ICRA 2002: 1347-1352 | |
| c21 | Khashayar Rohanimanesh, Sridhar Mahadevan: Learning to Take Concurrent Actions. NIPS 2002: 1619-1626 | |
| c20 | ||
| 2001 | ||
| c19 | Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamzadeh: Hierarchical multi-agent reinforcement learning. Agents 2001: 246-253 | |
| c18 | Silviu Minut, Sridhar Mahadevan: A reinforcement learning model of selective visual attention. Agents 2001: 457-464 | |
| c17 | Mohammad Ghavamzadeh, Sridhar Mahadevan: Continuous-Time Hierarchical Reinforcement Learning. ICML 2001: 186-193 | |
| c16 | Georgios Theocharous, Khashayar Rohanimanesh, Sridhar Mahadevan: Learning Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation. ICRA 2001: 511-516 | |
| c15 | Khashayar Rohanimanesh, Sridhar Mahadevan: Decision-Theoretic Planning with Concurrent Temporally Extended Actions. UAI 2001: 472-479 | |
| 2000 | ||
| c14 | Silviu Minut, Sridhar Mahadevan, John M. Henderson, Fred C. Dyer: Face Recognition Using Foveal Vision. Biologically Motivated Computer Vision 2000: 424-433 | |
| c13 | Natalia Hernandez-Gardiol, Sridhar Mahadevan: Hierarchical Memory-Based Reinforcement Learning. NIPS 2000: 1047-1053 | |
| 1999 | ||
| c12 | ||
| 1998 | ||
| j7 | Sridhar Mahadevan, Georgios Theocharous, Nikfar Khaleeli: Rapid Concept Learning for Mobile Robots. Auton. Robots 5(3-4): 239-251 (1998) | |
| j6 | Sridhar Mahadevan, Georgios Theocharous, Nikfar Khaleeli: Rapid Concept Learning for Mobile Robots. Machine Learning 31(1-3): 7-27 (1998) | |
| c11 | Sridhar Mahadevan, Georgios Theocharous: Optimizing Production Manufacturing Using Reinforcement Learning. FLAIRS Conference 1998: 372-377 | |
| 1996 | ||
| j5 | Sridhar Mahadevan, Leslie Pack Kaelbling: The National Science Foundation Workshop on Reinforcement Learning. AI Magazine 17(4): 89-93 (1996) | |
| j4 | Sridhar Mahadevan: Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results. Machine Learning 22(1-3): 159-195 (1996) | |
| c10 | Sridhar Mahadevan: An Average-Reward Reinforcement Learning Algorithm for Computing Bias-Optimal Policies. AAAI/IAAI, Vol. 1 1996: 875-880 | |
| c9 | Sridhar Mahadevan: Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning. ICML 1996: 328-336 | |
| 1994 | ||
| j3 | Sridhar Mahadevan, Prasad Tadepalli: Quantifying Prior Determination Knowledge Using the PAC Learning Model. Machine Learning 17(1): 69-105 (1994) | |
| c8 | Sridhar Mahadevan: To Discount or Not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning. ICML 1994: 164-172 | |
| 1993 | ||
| j2 | Sridhar Mahadevan, Tom M. Mitchell, Jack Mostow, Louis I. Steinberg, Prasad Tadepalli: An Apprentice-Based Approach to Knowledge Acquisition. Artif. Intell. 64(1): 1-52 (1993) | |
| 1992 | ||
| j1 | Sridhar Mahadevan, Jonathan Connell: Automatic Programming of Behavior-Based Robots Using Reinforcement Learning. Artif. Intell. 55(2): 311-365 (1992) | |
| c7 | Sridhar Mahadevan: Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions. ML 1992: 290-299 | |
| 1991 | ||
| c6 | Sridhar Mahadevan, Jonathan Connell: Automatic Programming of Behavior-Based Robots Using Reinforcement Learning. AAAI 1991: 768-773 | |
| c5 | Sridhar Mahadevan, Jonathan Connell: Scaling Reinforcement Learning to Robotics by Exploiting the Subsumption Architecture. ML 1991: 328-332 | |
| 1989 | ||
| c4 | Sridhar Mahadevan: Using Determinations in EBL: A Solution to the incomplete Theory Problem. ML 1989: 320-325 | |
| 1988 | ||
| c3 | Sridhar Mahadevan, Prasad Tadepalli: On the Tractability of Learning from Incomplete Theories. ML 1988: 235-241 | |
| 1985 | ||
| c2 | Tom M. Mitchell, Sridhar Mahadevan, Louis I. Steinberg: LEAP: A Learning Apprentice for VLSl Design. IJCAI 1985: 573-580 | |
| c1 | Sridhar Mahadevan: Verification-based Learning: A Generalized Strategy for Inferring Problem-Reduction Methods. IJCAI 1985: 616-623 | |
Colors in the list of coauthors
Last update Sat May 25 05:13:26 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page