Please note: This is a beta version of the new dblp website.
You can find the classic dblp view of this page here.
You can find the classic dblp view of this page here.
Shalabh Bhatnagar
2010 – today
- 2012
[j35]H. L. Prasad, Shalabh Bhatnagar: General-sum stochastic games: Verifiability conditions for Nash equilibria. Automatica 48(11): 2923-2930 (2012)
[j34]Koteswara Rao Vemu, Shalabh Bhatnagar, N. Hemachandra: Optimal multi-layered congestion based pricing schemes for enhanced QoS. Computer Networks 56(4): 1249-1262 (2012)
[j33]Shalabh Bhatnagar, K. Lakshmanan: An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. J. Optimization Theory and Applications 153(3): 688-708 (2012)
[j32]L. A. Prashanth, Shalabh Bhatnagar: Threshold Tuning Using Stochastic Optimization for Graded Signal Control. IEEE T. Vehicular Technology 61(9): 3865-3880 (2012)
[c17]Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar: q-Gaussian based Smoothed Functional algorithms for stochastic optimization. ISIT 2012: 1059-1063
[i6]Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar: q-Gaussian based Smoothed Functional Algorithm for Stochastic Optimization. CoRR abs/1202.5665 (2012)
[i5]Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar: Properties of Multivariate . CoRR abs/1206.4832 (2012)- 2011
[j31]Shalabh Bhatnagar: The Borkar-Meyn theorem for asynchronous stochastic approximations. Systems & Control Letters 60(7): 472-478 (2011)
[j30]Shalabh Bhatnagar, Vivek Kumar Mishra, N. Hemachandra: Stochastic Algorithms for Discrete Parameter Simulation Optimization. IEEE T. Automation Science and Engineering 8(4): 780-793 (2011)
[j29]Karmeshu, Shalabh Bhatnagar, Vivek Kumar Mishra: An Optimized SDE Model for Slotted Aloha. IEEE Transactions on Communications 59(6): 1502-1508 (2011)
[j28]L. A. Prashanth, Shalabh Bhatnagar: Reinforcement Learning With Function Approximation for Traffic Signal Control. IEEE Transactions on Intelligent Transportation Systems 12(2): 412-421 (2011)
[j27]Shalabh Bhatnagar, N. Hemachandra, Vivek Kumar Mishra: Stochastic approximation algorithms for constrained optimization via simulation. ACM Trans. Model. Comput. Simul. 21(3): 15 (2011)
[c16]K. Lakshmanan, Shalabh Bhatnagar: Smoothed Functional and Quasi-Newton Algorithms for Routing in Multi-stage Queueing Network with Constraints. ICDCIT 2011: 175-186
[c15]L. A. Prashanth, H. L. Prasad, Nirmit Desai, Shalabh Bhatnagar, Gargi Banerjee Dasgupta: Stochastic Optimization for Adaptive Labor Staffing in Service Systems. ICSOC 2011: 487-494- 2010
[j26]Shalabh Bhatnagar: An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes. Systems & Control Letters 59(12): 760-766 (2010)
[j25]Anshuk Chakraborty, Shalabh Bhatnagar: Optimized Policies for the Retransmission Probabilities in Slotted Aloha. Simulation 86(4): 247-261 (2010)
[j24]G. Ramana Reddy, Shalabh Bhatnagar, V. Rakesh, Vijay Prakash Chaturvedi: An efficient algorithm for scheduling in bluetooth piconets and scatternets. Wireless Networks 16(7): 1799-1816 (2010)
[c14]Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton: Toward Off-Policy Learning Control with Function Approximation. ICML 2010: 719-726
2000 – 2009
- 2009
[j23]Shalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Natural actor-critic algorithms. Automatica 45(11): 2471-2482 (2009)
[j22]Shalabh Bhatnagar, Rajesh Kumar Patro: A proof of convergence of the B-RED and P-RED algorithms for random early detection. IEEE Communications Letters 13(10): 809-811 (2009)
[j21]Rajesh Kumar Patro, Shalabh Bhatnagar: A probabilistic constrained nonlinear optimization framework to optimize RED parameters. Perform. Eval. 66(2): 81-104 (2009)
[j20]Shalabh Bhatnagar, Karmeshu, Vivek Kumar Mishra: Optimal parameter trajectory estimation in parameterized SDEs: An algorithmic procedure. ACM Trans. Model. Comput. Simul. 19(2) (2009)
[c13]Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári: LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. CDC 2009: 1181-1188
[c12]Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125
[c11]Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009: 1204-1212
[c10]Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195
[r1]P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar: Pattern Synthesis for Nonparametric Pattern Recognition. Encyclopedia of Data Warehousing and Mining 2009: 1511-1516- 2008
[j19]Shalabh Bhatnagar, K. Mohan Babu: New algorithms of the Q-learning type. Automatica 44(4): 1111-1119 (2008)
[j18]Sudha Velusamy, Lakshmi Gopal, Shalabh Bhatnagar, Sridhar Varadarajan: An efficient ad recommendation system for TV programs. Multimedia Syst. 14(2): 73-87 (2008)
[j17]Shalabh Bhatnagar, Mohammed Shahid Abdulla: Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes. Simulation 84(12): 577-600 (2008)
[c9]Sudha Velusamy, Shalabh Bhatnagar, S. V. Basavaraja, V. Sridhar: SPSA based feature relevance estimation for video retrieval. MMSP 2008: 598-603
[c8]Sudha Rani Kolavali, Shalabh Bhatnagar: Ant Colony Optimization Algorithms for Shortest Path Problems. NET-COOP 2008: 37-44- 2007
[j16]Mohammed Shahid Abdulla, Shalabh Bhatnagar: Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. Discrete Event Dynamic Systems 17(1): 23-52 (2007)
[j15]Ambedkar Dukkipati, Shalabh Bhatnagar, M. Narasimha Murty: Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals. Inf. Sci. 177(24): 5707-5714 (2007)
[j14]Shalabh Bhatnagar: Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization. ACM Trans. Model. Comput. Simul. 18(1) (2007)
[c7]Sudha Velusamy, Lakshmi Gopal, Sridhar Varadarajan, Shalabh Bhatnagar: Fuzzy Clustering Based Ad Recommendation for TV Programs. EuroITV 2007: 175-184
[c6]Vijay Prakash Chaturvedi, V. Rakesh, Shalabh Bhatnagar: An Efficient and Optimized Bluetooth Scheduling Algorithm for Piconets. ICDCIT 2007: 19-30
[c5]Koteswara Rao Vemu, Shalabh Bhatnagar, N. Hemachandra: An Optimal Weighted-Average Congestion Based Pricing Scheme for Enhanced QoS. ICDCIT 2007: 135-145
[c4]Shalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Incremental Natural Actor-Critic Algorithms. NIPS 2007- 2006
[j13]Shalabh Bhatnagar, J. Ranjan Panigrahi: Actor-critic algorithms for hierarchical Markov decision processes. Automatica 42(4): 637-644 (2006)
[j12]Shalabh Bhatnagar, Vivek S. Borkar, Madhukar Akarapu: A Simulation-Based Algorithm for Ergodic Control of Markov Chains Conditioned on Rare Events. Journal of Machine Learning Research 7: 1937-1962 (2006)
[j11]P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar: Partition based pattern synthesis technique with efficient algorithms for nearest neighbor classification. Pattern Recognition Letters 27(14): 1714-1724 (2006)
[c3]Mohammed Shahid Abdulla, Shalabh Bhatnagar: SPSA algorithms with measurement reuse. Winter Simulation Conference 2006: 320-328
[i4]Ambedkar Dukkipati, M. Narasimha Murty, Shalabh Bhatnagar: On Measure Theoretic definitions of Generalized Information Measures and Maximum Entropy Prescriptions. CoRR abs/cs/0601080 (2006)- 2005
[j10]P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar: Overlap pattern synthesis with an efficient nearest neighbor classifier. Pattern Recognition 38(8): 1187-1195 (2005)
[j9]Shalabh Bhatnagar, Hemant J. Kowshik: A Discrete Parameter Stochastic Approximation Algorithm for Simulation Optimization. Simulation 81(11): 757-772 (2005)
[j8]Shalabh Bhatnagar, I. Bala Bhaskar Reddy: Optimal Threshold Policies for Admission Control in Communication Networks via Discrete Parameter Stochastic Approximation. Telecommunication Systems 29(1): 9-31 (2005)
[j7]Shalabh Bhatnagar: Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization. ACM Trans. Model. Comput. Simul. 15(1): 74-107 (2005)
[c2]Ambedkar Dukkipati, M. Narasimha Murty, Shalabh Bhatnagar: Information theoretic justification of Boltzmann selection and its generalization to Tsallis case. Congress on Evolutionary Computation 2005: 1667-1674
[i3]Ambedkar Dukkipati, M. Narasimha Murty, Shalabh Bhatnagar: Uniqueness of Nonextensive entropy under Renyi's Recipe. CoRR abs/cs/0511078 (2005)- 2004
[j6]P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar: Fusion of multiple approximate nearest neighbor classifiers for fast and efficient classification. Information Fusion 5(4): 239-250 (2004)
[j5]Shalabh Bhatnagar, Shishir Kumar: A simultaneous perturbation stochastic approximation-based actor-critic algorithm for Markov decision processes. IEEE Trans. Automat. Contr. 49(4): 592-598 (2004)
[c1]P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar: A Pattern Synthesis Technique with an Efficient Nearest Neighbor Classifier for Binary Pattern Recognition. ICPR (4) 2004: 416-419
[i2]Ambedkar Dukkipati, M. Narasimha Murty, Shalabh Bhatnagar: Generalized Evolutionary Algorithm based on Tsallis Statistics. CoRR cs.AI/0407037 (2004)
[i1]Ambedkar Dukkipati, M. Narasimha Murty, Shalabh Bhatnagar: Cauchy Annealing Schedule: An Annealing Schedule for Boltzmann Selection Scheme in Evolutionary Algorithms. CoRR cs.AI/0408055 (2004)- 2003
[j4]Shalabh Bhatnagar, Vivek S. Borkar: Multiscale Chaotic SPSA and Smoothed Functional Algorithms for Simulation Optimization. Simulation 79(10): 568-580 (2003)
[j3]Shalabh Bhatnagar, Michael C. Fu, Steven I. Marcus, I-Jeng Wang: Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences. ACM Trans. Model. Comput. Simul. 13(2): 180-209 (2003)- 2002
[j2]Xi-Ren Cao, Zhiyuan Ren, Shalabh Bhatnagar, Michael C. Fu, Steven I. Marcus: A time aggregation approach to Markov decision processes. Automatica 38(6): 929-943 (2002)- 2001
[j1]Shalabh Bhatnagar, Michael C. Fu, Steven I. Marcus, Pedram Jaefari Fard: Optimal structured feedback policies for ABR flow control using two-timescale SPSA. IEEE/ACM Trans. Netw. 9(4): 479-491 (2001)
Coauthor Index
data released under the ODC-BY 1.0 license. See also our legal information page
last updated on 2013-01-18 20:03 CET by the dblp team



