2025
-
Ellis, S., Song, S., Reiman, D., Hui, X., Zhang, R., Shahriar, M. H., … & Ahsan, H. (2025). Improved Diagnosis of Non-Melanoma Skin Cancer in Resource-Limited Settings. Cancer Epidemiology, Biomarkers & Prevention.
-
Song, S., Borjigin-Wang, M., Madejski, I., & Grossman, R. L. (2025). Multimodal Multimodal Cancer Modeling in the Age of Foundation Model Embeddings. arXiv [Cs.LG]. Retrieved from http://arxiv.org/abs/2505.07683
- Song, S., Subramanyam, A., Zhang, Z., Venkat, A., & Grossman, R. L. (11 2025). GDC Cohort Copilot: an AI copilot for curating cohorts from the genomic data commons. Bioinformatics Advances, 5(1), vbaf295. https://doi.org/10.1093/bioadv/vbaf295
- Subramanyam, A., Chen, Y., & Grossman, R. L. (2025). Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining. arXiv [Cs.LG]. Retrieved from http://arxiv.org/abs/2510.03313
-
Venkat, A., Wysocki, W. P., Lukowski, M., Song, S., Subramanyam, A., Zhang, Z., & Grossman, R. L. (2025). Query Augmented Generation (QAG) from the Genomic Data Commons for Accurate Variant Statistics. bioRxiv. doi:10.1101/2025.09.02.673797
2024
- Grossman, R. L., Boyd, C., Do, N., Elbers, D. C., Fitzsimons, M. S., Giger, M. L., … & Venkat, A. (2024). Ten Pillars for Data Meshes. arXiv preprint arXiv:2411.05248.
- Song, S., Subramanyam, A., Madejski, I., & Grossman, R. L. (2024). LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation. arXiv preprint arXiv:2411.16523.