Pooja Yadav

Watching Sundar Pichai talk about ‘Why Machine Learning is the future’ gave me a feeling of consciousness, back in 2017. 'Better computational power and access to a lot of relevant data are the two main causes of the progress in ML' he said. I've always felt this branch of computer science to be exciting, be it data analysis, or data visualization and emphasizing it with ML algorithms or data storage. My immediate goal after masters is to become a Data Scientist. In the foreseable future, I see myself becoming a Data Scientist as I am highly inspired by D J Patil, Former Chief Data Scientist at the White House, first ever chief data scientist appointed by US government. His pointers about striving for curiosity, team building, social and local problem solving always inspire me to crave for becoming a CDS. I am a motivated and a versatile engineer with demonstrated experience in Machine Learning and Data Analysis domain. Experienced in data extraction, developing ML algorithms and developing final predictive model. I am a strong problem solver, have excellent communication skills and dedicated to exceed goals and expectations.

Academics

Academics not only helps us get new and better jobs but also brings out the positive potentials within us.

MASTER OF SCIENCE (MS)

San Diego State University, Computer Science
Aug 2023

Courses

Advanced Algorithms (CS-660), Machine Learning (CS-549), Big Data Analytics (CS-649) Principles and Techniques of Data Science (CS-577), Advanced Object Oriented Programming (CS-635), Research (CS-797)

BACHELOR OF ENGINEERING (BE)

University of Mumbai, Computer Engineering
June 2019

Courses

Data Structures & Algorithms, Operations Research, Data Mining and Warehousing, Image Processing, Computer Organization & Architecture, Object Oriented Programming, Computer Graphics, Theory of computer Science, Database Management System, Software Engineering, Computer Networks, Design and Analysis Algorithm

Internships

Internships give the first hand exposure working in the real world. It allows us to implement skill, knowlegde and the theoretical practice learnt before. I beleive that it is a learning curve with little experience of the professional world.

Projects

I am of the opinion that hands on coding or projects are the basic literacy in the digital world. It is really important for a programmer to understand and be able to work with the technology around him or her. I have been coding since my engineering and below are a few examples to demonstrate my knowlegde, creativity, math and confidence.

Ethio Hydro

Developed a web application to analyse and monitor 67 million rows of rain and temperature Time Series geospatial data. Implemented parallel processing to speed up the data manipulation and cleaning process by 80%. Extracted location and visualized geospatial data. Created bi-directional communication on a folium map using haversine distance to cut down the latitude-longitude search time by 20% and deployed the application with docker container. Technologies used: Streamlit, Python, Pandas, NumPy, geoPy, AWS, Swifter, Pandarallel, Plotly, Altair Charts, and Folium

New York City Airbnb Rent prediction

Worked with Airbnb historical data from Kaggle to predict it's rent in the NewYork city using Multiple Linear Regression with an accuracy of 55%, performed Exploratory Data Analysis using Pandas, NumPy, and Seaborn. Applied XGBoost to improve the model accuracy from 55% to 85% Encoded different categorical data using one-hot encoding and treated missing values using mean imputation

Titanic Survival Prediction

Explored the Titanic disaster dataset from Kaggle and applied Logistic Regression to predict the survival of the Titanic passengers performed Feature Selection, Exploratory Data Analysis, and mean imputation using Pandas and NumPy implemented one-hot encoding, K-fold cross-validation, and Hyperparameter Tuning improving the model accuracy from 62% to 77%

Word Processor

Implemented a Word Processor using Flyweight System design pattern in python3, that given a unicode code point returns the Flyweight character object for the character and font

In-Memory Database

Explored how to implement In-Memory database in python3 using Command and Memento system design pattern which helped me understand System design pattern in depth.

Uber Data Analytics

Modeled and Implemented an ETL pipeline utilizing Python, Google Compute Engine, and mage-ai to efficiently extract, transform, and load Uber data into BigQuery. Created an interactive Looker Studio dashboard to perform an in-depth analysis of Uber data based on different parameters. Implemented filters and drill-down capabilities, enhancing user experience and enabling rapid trend identification and opportunities for optimization

Skills & Organizations

Skills are a important aspect of a person's career as they define your ability in your domain. In my opinion, with skills and knowledge, socializing is also an important aspect. Throughout my academic career, I have joined a lot of labs and organizations for improving my self confidence, public speaking and networking. Technical as well as Non- technical knowledge are important when it comes to thriving in the society. Here are a few of my skills and the organizations that I have joined uptil now.

Get in touch

How about getting connected?
You can contact me anywhere on the links provided below.