Follow
Tianchi Cai
Tianchi Cai
LLM Alignment, Minimax
Verified email at minimaxi.com
Title
Cited by
Cited by
Year
Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses
Y Cao, T Cai, L Feng, L Gu, GU Jinjie, B An, G Niu, M Sugiyama
Advances in Neural Information Processing Systems, 2022
242022
Joint incentive optimization of customer and merchant in mobile payment marketing
L Yu, Z Wu, T Cai, Z Liu, Z Zhang, L Gu, X Zeng, J Gu
Proceedings of the AAAI Conference on Artificial Intelligence 35 (17), 15000 …, 2021
102021
ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference
T Cai, X Song, J Jiang, F Teng, J Gu, G Zhang
arXiv preprint arXiv:2312.02554, 2023
42023
Marketing budget allocation with offline constrained deep reinforcement learning
T Cai, J Jiang, W Zhang, S Zhou, X Song, L Yu, L Gu, X Zeng, J Gu, ...
Proceedings of the Sixteenth ACM International Conference on Web Search and …, 2023
42023
LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign
T Cai, D Cheng, C Liang, Z Liu, L Gu, H Xie, Z Zhang, X Zeng, J Gu
Database Systems for Advanced Applications: 26th International Conference …, 2021
22021
Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese
Y Xu, T Cai, J Jiang, X Song
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024
12024
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
T Cai, S Bao, J Jiang, S Zhou, W Zhang, L Gu, J Gu, G Zhang
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
12023
Robust Offline Reinforcement Learning from Low-Quality Data
W Shi, T Cai, S Song, L Gu, J Gu, G Huang
12020
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
T Cai, Z Tan, X Song, T Sun, J Jiang, Y Xu, Y Zhang, J Gu
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024
2024
Mitigate Position Bias with Coupled Ranking Bias on CTR Prediction
Y Zhao, Z Liu, T Cai, H Zhang, C Zhuang, J Gu
arXiv preprint arXiv:2405.18971, 2024
2024
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
T Cai, W Zhang, L Gu, X Zeng, J Gu
Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 38 th …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–11