Follow
Daniil Tiapkin
Daniil Tiapkin
Other namesDaniil Tyapkin, Daniil Nikolaevich Tyapkin
Verified email at polytechnique.edu - Homepage
Title
Cited by
Cited by
Year
Improved complexity bounds in wasserstein barycenter problem
D Dvinskikh, D Tiapkin
International Conference on Artificial Intelligence and Statistics, 1738-1746, 2021
282021
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D Tiapkin, D Belomestny, E Moulines, A Naumov, S Samsonov, Y Tang, ...
International Conference on Machine Learning, 21380-21431, 2022
152022
Stochastic saddle-point optimization for the Wasserstein barycenter problem
D Tiapkin, A Gasnikov, P Dvurechensky
Optimization Letters 16 (7), 2145-2175, 2022
122022
Generative Flow Networks as Entropy-Regularized RL
D Tiapkin, N Morozov, A Naumov, D Vetrov
AISTATS-2024, 2023
112023
Fast Rates for Maximum Entropy Exploration
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
International Conference on Machine Learning, 2023
92023
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
Neural Information Processing Systems, 2022
92022
Primal-Dual Stochastic Mirror Descent for MDPs
D Tiapkin, A Gasnikov
International Conference on Artificial Intelligence and Statistics, 9723-9740, 2022
92022
Demonstration-Regularized RL
D Tiapkin, D Belomestny, D Calandriello, E Moulines, A Naumov, ...
ICLR-2024, 2023
7*2023
Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold
S Schechtman, D Tiapkin, M Muehlebach, E Moulines
The Thirty Sixth Annual Conference on Learning Theory, 1228-1258, 2023
62023
Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability
S Samsonov, D Tiapkin, A Naumov, E Moulines
The Thirty Seventh Annual Conference on Learning Theory, 4511-4547, 2024
5*2024
Incentivized Learning in Principal-Agent Bandit Games
A Scheid, D Tiapkin, E Boursier, A Capitaine, EME Mhamdi, É Moulines, ...
arXiv preprint arXiv:2403.03811, 2024
32024
First-Order Constrained Optimization: Non-smooth Dynamical System Viewpoint
S Schechtman, D Tiapkin, E Moulines, MI Jordan, M Muehlebach
IFAC-PapersOnLine 55 (16), 236-241, 2022
32022
Model-free Posterior Sampling via Learning Rate Randomization
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
Advances in Neural Information Processing Systems 36, 2024
12024
A New Bound on the Cumulant Generating Function of Dirichlet Processes
P Perrault, D Belomestny, P Ménard, É Moulines, A Naumov, D Tiapkin, ...
arXiv preprint arXiv:2409.18621, 2024
2024
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
D Tiapkin, E Chzhen, G Stoltz
arXiv preprint arXiv:2407.05704, 2024
2024
Improving GFlowNets with Monte Carlo Tree Search
N Morozov, D Tiapkin, S Samsonov, A Naumov, D Vetrov
arXiv preprint arXiv:2406.13655, 2024
2024
On the structure of the set of panchromatic colorings of a random hypergraph
DN Tyapkin, DA Shabanov
Doklady Mathematics 108 (1), 286-290, 2023
2023
Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms
D Belomestny, P Menard, A Naumov, D Tiapkin, M Valko
arXiv preprint arXiv:2304.03056, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–18