Corporate Projects
Two years of engineering co-op at Collibra, a $5-billion enterprise intelligence and data governance company listed as the 7th most valuable data startup in the world, serving a wide spectrum of clients, including Apple, Adobe, Equifax, AXA, L'Oreal, Orange, Moody's, Freddie Mac and BNP Paribas.
-
Machine Learning Engineer 2020Knowledge Graph
Researched deep learning approaches for tabular representation learning. Models were trained to learn privacy preserving representations which could be used for entity deduplication, without client data leaving the customers' premises.
Developed a data collection pipeline to transform an unstructured corpus of academic and business journals into a domain specific knowledge graph, using active learning, clustering and topic modeling for a data mining research project with Collibra-partner UC San Diego BlockLAB.
Keras, word2vec, spaCy, Gensim, NLTK, NumPy, TensorBoard -
Machine Learning Engineer 2021Business Process Automation
Developed internal tool to automate issue/feature backlog prioritization and assignment. Designed custom issue/feedback forms for ease of feature extraction.
Created and deployed a data pipeline to aggregate issues from engineering (Jira) and customer (Aha!) backlogs, and contextualize issues using customer and product metadata from various sources (Salesforce, Confluence, GitHub, etc).
Prototyped a ranking model to automate allocation and prioritization of issues.
Github Actions, Pydantic, Beautiful Soup, Collibra Data Catalog