top of page
Completed
MIT Koch Institute
TIMELINE
FIELD
ROLE
2026-present
Data science &
Student Researcher
Computational biology
STATUS
In-progress
OVERVIEW
WHAT I DID
My goal, as described above, is to perform data processing to feed the model. For this dataset, however, I also provided visuals and more biological context to show the reliability and overall setup of procedure, which involves:
1. Setup - importing appropriate libraries, reading in raw data files
2. Creating a violin plot to demonstrate dataset health, where the amount of useless/error data is expressed
3. Eliminating biologically useless/error data
4. Cleaning the data and preparing it for visual purposes
5. Calling upon different genes in the dataset to view their relative location in the plot
6. Classifying different regions of the cell data through that observation of gene clusters, looking out for expression of genes like Ptprc, which signals immune cell interference (not the intestinal cells we are focused on)



TAKEAWAYS
bottom of page