Publications

2025

  • Ellis, S., Song, S., Reiman, D., Hui, X., Zhang, R., Shahriar, M. H., … & Ahsan, H. (2025). Improved Diagnosis of Non-Melanoma Skin Cancer in Resource-Limited Settings. Cancer Epidemiology, Biomarkers & Prevention.

  • Song, S., Borjigin-Wang, M., Madejski, I., & Grossman, R. L. (2025). Multimodal Multimodal Cancer Modeling in the Age of Foundation Model Embeddings. arXiv [Cs.LG]. Retrieved from http://arxiv.org/abs/2505.07683

  • Song, S., Subramanyam, A., Zhang, Z., Venkat, A., & Grossman, R. L. (11 2025). GDC Cohort Copilot: an AI copilot for curating cohorts from the genomic data commons. Bioinformatics Advances5(1), vbaf295. https://doi.org/10.1093/bioadv/vbaf295
  • Subramanyam, A., Chen, Y., & Grossman, R. L. (2025). Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining. arXiv [Cs.LG]. Retrieved from http://arxiv.org/abs/2510.03313
  • Venkat, A., Wysocki, W. P., Lukowski, M., Song, S., Subramanyam, A., Zhang, Z., & Grossman, R. L. (2025). Query Augmented Generation (QAG) from the Genomic Data Commons for Accurate Variant Statistics. bioRxiv. doi:10.1101/2025.09.02.673797

 

2024

  • Grossman, R. L., Boyd, C., Do, N., Elbers, D. C., Fitzsimons, M. S., Giger, M. L., … & Venkat, A. (2024). Ten Pillars for Data Meshes. arXiv preprint arXiv:2411.05248.
  • Song, S., Subramanyam, A., Madejski, I., & Grossman, R. L. (2024). LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation. arXiv preprint arXiv:2411.16523.
Scroll to Top