A prototype-oriented framework for unsupervised domain adaptation K Tanwisuth, X Fan, H Zheng, S Zhang, H Zhang, B Chen, M Zhou Advances in Neural Information Processing Systems 34, 17194-17208, 2021 | 108 | 2021 |
Fusedream: Training-free text-to-image generation with improved clip+ gan space optimization X Liu, C Gong, L Wu, S Zhang, H Su, Q Liu arXiv preprint arXiv:2112.01573, 2021 | 82 | 2021 |
AutoML-GPT: Automatic Machine Learning with GPT S Zhang, C Gong, L Wu, X Liu, M Zhou arXiv preprint arXiv:2305.02499, 2023 | 79 | 2023 |
Bayesian attention modules X Fan, S Zhang, B Chen, M Zhou Advances in Neural Information Processing Systems 33, 16362-16376, 2020 | 70 | 2020 |
Knowing more about questions can help: Improving calibration in question answering S Zhang, C Gong, E Choi arXiv preprint arXiv:2106.01494, 2021 | 49 | 2021 |
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models K Tanwisuth, S Zhang, H Zheng, P He, M Zhou International Conference on Machine Learning, 33816-33832, 2023 | 39 | 2023 |
Learning from uneven training data: Unlabeled, single label, and multiple labels S Zhang, C Gong, E Choi arXiv e-prints, arXiv: 2109.04408, 2021 | 38* | 2021 |
ALLSH: Active Learning Guided by Local Sensitivity and Hardness S Zhang, C Gong, X Liu, P He, W Chen, M Zhou arXiv preprint arXiv:2205.04980, 2022 | 36 | 2022 |
Contextual dropout: An efficient sample-dependent dropout module X Fan, S Zhang, K Tanwisuth, X Qian, M Zhou arXiv preprint arXiv:2103.04181, 2021 | 34 | 2021 |
Bayesian attention belief networks S Zhang, X Fan, B Chen, M Zhou International Conference on Machine Learning, 12413-12426, 2021 | 32 | 2021 |
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems Y Feng, S Yang, S Zhang, J Zhang, C Xiong, M Zhou, H Wang arXiv preprint arXiv:2302.10342, 2023 | 22 | 2023 |
A unified framework for alternating offline model training and policy learning S Yang, S Zhang, Y Feng, M Zhou Advances in Neural Information Processing Systems 35, 17216-17232, 2022 | 13 | 2022 |
WPO: Enhancing RLHF with Weighted Preference Optimization W Zhou, R Agrawal, S Zhang, SR Indurthi, S Zhao, K Song, S Xu, C Zhu Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 11 | 2024 |
Preference-grounded token-level guidance for language model fine-tuning S Yang, S Zhang, C Xia, Y Feng, C Xiong, M Zhou Advances in Neural Information Processing Systems 36, 2024 | 11 | 2024 |
FlowGrad: Controlling the Output of Generative ODEs with Gradients X Liu, L Wu, S Zhang, C Gong, W Ping, Q Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 11 | 2023 |
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning S Yang, Y Feng, S Zhang, M Zhou International Conference on Machine Learning, 24980-25006, 2022 | 11 | 2022 |
Alignment attention by matching key and query distributions S Zhang, X Fan, H Zheng, K Tanwisuth, M Zhou Advances in Neural Information Processing Systems 34, 13444-13457, 2021 | 11 | 2021 |
Passage-Mask: A Learnable Regularization Strategy for Retriever-Reader Models S Zhang, C Gong, X Liu Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 10 | 2022 |
Capturing label distribution: A case study in nli S Zhang, C Gong, E Choi arXiv preprint arXiv:2102.06859, 2021 | 9 | 2021 |
Sliced Wasserstein with random-path projecting directions K Nguyen, S Zhang, T Le, N Ho Proceedings of the ICML, 2024, 2024 | 4 | 2024 |