Google Scholar

User profiles for Shipra Agrawal

Shipra Agrawal

- Verified email at columbia.edu - Cited by 6519

Shipra Agrawal

- Verified email at nationwidechildrens.org - Cited by 1574

[PDF] mlr.press

Analysis of thompson sampling for the multi-armed bandit problem

S Agrawal, N Goyal - Conference on learning theory, 2012 - proceedings.mlr.press

The multi-armed bandit problem is a popular model for studying exploration/exploitation
trade-off in sequential decision problems. Many algorithms are now available for this well-…

Save Cite Cited by 1500 Related articles All 15 versions View as HTML

[PDF] mlr.press

Thompson sampling for contextual bandits with linear payoffs

S Agrawal, N Goyal - International conference on machine …, 2013 - proceedings.mlr.press

Thompson Sampling is one of the oldest heuristics for multi-armed bandit problems. It is a
randomized algorithm based on Bayesian ideas, and has recently generated significant …

Save Cite Cited by 1136 Related articles All 11 versions View as HTML

[PDF] arxiv.org

Bandits with concave rewards and convex knapsacks

S Agrawal, NR Devanur - Proceedings of the fifteenth ACM conference …, 2014 - dl.acm.org

In this paper, we consider a very general model for exploration-exploitation tradeoff which
allows arbitrary concave rewards and convex constraints on the decisions across time, in …

Save Cite Cited by 212 Related articles All 6 versions

[PDF] arxiv.org

A framework for high-accuracy privacy-preserving mining

S Agrawal, JR Haritsa - 21st International Conference on Data …, 2005 - ieeexplore.ieee.org

To preserve client privacy in the data mining process, a variety of techniques based on
random perturbation of individual data records have been proposed recently. In this paper, we …

Save Cite Cited by 239 Related articles All 10 versions

[PDF] mlr.press

Near-optimal regret bounds for thompson sampling

S Agrawal, N Goyal - Journal of the ACM (JACM), 2017 - dl.acm.org

Thompson Sampling (TS) is one of the oldest heuristics for multiarmed bandit problems. It is
a randomized algorithm based on Bayesian ideas and has recently generated significant …

Save Cite Cited by 641 Related articles All 15 versions

[HTML] oup.com Full View

piRNABank: a web resource on classified and clustered Piwi-interacting RNAs

S Sai Lakshmi, S Agrawal - Nucleic acids research, 2008 - academic.oup.com

Piwi-interacting RNAs (piRNAs) are expressed in mammalian germline cells and have been
identified as key players in germline development. These molecules, typically of length 25–…

Save Cite Cited by 392 Related articles All 17 versions

[PDF] arxiv.org

A dynamic near-optimal algorithm for online linear programming

S Agrawal, Z Wang, Y Ye - Operations Research, 2014 - pubsonline.informs.org

A natural optimization model that formulates many online resource allocation problems is the
online linear programming (LP) problem in which the constraint matrix is revealed column …

Save Cite Cited by 346 Related articles All 20 versions

[HTML] nih.gov

Dyslipidaemia in nephrotic syndrome: mechanisms and treatment

S Agrawal, JJ Zaritsky, A Fornoni… - Nature Reviews …, 2018 - nature.com

Nephrotic syndrome is a highly prevalent disease that is associated with high morbidity
despite notable advances in its treatment. Many of the complications of nephrotic syndrome, …

Save Cite Cited by 287 Related articles All 16 versions

[PDF] neurips.cc

Optimistic posterior sampling for reinforcement learning: worst-case regret bounds

S Agrawal, R Jia - Advances in Neural Information …, 2017 - proceedings.neurips.cc

We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves
near-optimal worst-case regret bounds when the underlying Markov Decision Process (…

Save Cite Cited by 226 Related articles All 14 versions View as HTML

[PDF] mlr.press

Reinforcement learning for integer programming: Learning to cut

Y Tang, S Agrawal, Y Faenza - International conference on …, 2020 - proceedings.mlr.press

Integer programming is a general optimization framework with a wide variety of applications,
eg, in scheduling, production planning, and graph optimization. As Integer Programs (IPs) …

Save Cite Cited by 187 Related articles All 8 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Shipra Agrawal

Shipra Agrawal

Shipra Agrawal

Analysis of thompson sampling for the multi-armed bandit problem

Thompson sampling for contextual bandits with linear payoffs

Bandits with concave rewards and convex knapsacks

A framework for high-accuracy privacy-preserving mining

Near-optimal regret bounds for thompson sampling

piRNABank: a web resource on classified and clustered Piwi-interacting RNAs

A dynamic near-optimal algorithm for online linear programming

Dyslipidaemia in nephrotic syndrome: mechanisms and treatment

Optimistic posterior sampling for reinforcement learning: worst-case regret bounds

Reinforcement learning for integer programming: Learning to cut

Related searches