Baharan Mirzasoleiman
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Synthetic Text Generation for Training Large Language Models via Gradient Matching
arXiv 2025
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
arXiv 2025
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
arXiv 2024
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
arXiv 2024
Mini-batch Coresets for Memory-efficient Training of Large Language Models
arXiv 2024
Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least
arXiv 2023
Towards Sustainable Learning: Coresets for Data-efficient Deep Learning
arXiv 2023
Data-Efficient Augmentation for Training Neural Networks
data-efficient-augmentation-for-training
Affiliations
Frequent co-authors
10from 8 papers