Pivoting from Infra to ML Research
For the last few years, I’ve worked on data & ML infra: pipelines and orchestration for peta-bytes of data. Long-term though, I’m hoping to work closer to the models, methods, and ideas. I have some ideas in both research (tabular data / semi-structured data / document processing) and open-source (Rust ML) that I want to spend my time on. The bar for research is high and the field has evolved immensely since I was an undergrad. This repo will hopefully get me to the starting line.
Repo Link: peterlee123hi/ml-foundations