Sipu Hou

Sipu Hou

Data Analyst

@sipu-h

Joined Sep 2020

San Francisco, CA, USA

About

· Master’s degree in Business Analytics and data analyst with 3 years of work experience

· Expertise and hands-on experience in SQL, Tableau, Python, R, Excel, data analysis, machine learning and data mining

Experiences

22yrs 7mos

M.S. in Business Analytics

Sep 2017 - May 2020

2yrs 8mos

Sep 2017 - May 2020

2yrs 8mos

See more

**Designed a database model for real estate company** * Analyzed business process, created a Relational Data Model for the real estate company, including all Entities, Attributes, Data Relationships, Primary and Foreign key Structures. * Developed database tables for each of these entities in the model using SQL statements, and utilized SQL queries to retrieve and analyze data. * Created reports and forms using MS Access. **Analyzed the medical insurance cost using** **Tableau and R programming** * Used the boxplot to analyze the medical expenses against other categorical factors. * Applied Multiple Linear Regression Model to build a regression model and evaluate it. * Generated and improved model to analyze the relationship between medical insurance cost and other variables. **Implemented funnel analysis for an e-commerce website using Tableau and Python programming** * Segmented data by device, gender and the day of the week. * Calculated the lead-to-customer conversion rate and customer retention rate. * Built dashboards, drawn meaningful insights and made recommendations based on the descriptive modeling. **Performed the bank marketing analysis using Tableau and Python programming** * Used Python to retrieve data and clean data from structured data files. * Utilized Tableau and Seaborn package in Python to make data visualization. * Split the dataset into validation data and training data by portion 30%: 70%. * Performed forecasting using Naïve Bayes algorithm and Random Forest algorithm in repaying loan.
MySQL

MySQL

Microsoft Access

Microsoft Access

Microsoft SQL Server Manage...

Microsoft SQL Server Manage...

Python

Python

Jupyter

Jupyter

RStudio

RStudio

Microsoft Excel

Microsoft Excel

Google Analytics

Google Analytics

Google Sheets

Google Sheets

Tableau

Tableau

Pandas

Pandas

NumPy

NumPy

ggplot2

ggplot2

Matplotlib

Matplotlib

scikit-learn

scikit-learn

Data Analyst

Dec 2013 - Feb 2017

3yrs 2mos

Dec 2013 - Feb 2017

3yrs 2mos

See more

· Developed and maintained relational databases to record transactions of the company. · Wrote SQL queries to extract data from MySQL relational databases, and then provided custom reports to supervisor. · Constructed dashboards using Microsoft Excel to track seasonality including key insights such as monthly active user (MAU), retention rate and revenue. · Built optimization models and applied them to coal cleaning processes, which improved the revenue by 13%.
Microsoft Excel

Microsoft Excel

MySQL

MySQL

Microsoft Access

Microsoft Access

Lecturer and Researcher

Jul 2004 - Jun 2007

2yrs 11mos

Jul 2004 - Jun 2007

2yrs 11mos

See more

· Prepared and delivered lectures for 24 classes of roughly 900 students. · Utilized Excel to visualize student test score data via pivot table and clustered column charts. · Calculated the successful rate of each question in examinations and built the dashboards.
Microsoft Excel

Microsoft Excel

Research Assistant

Sep 2001 - Jul 2004

2yrs 10mos

Sep 2001 - Jul 2004

2yrs 10mos

See more

· Collected and cleaned data with Excel, and then analyzed data using Pivot Tables and charts. · Used Origin software to create scientific graphs and make model. · Assisted with duties related to the production of academic journals.
Microsoft Excel

Microsoft Excel

M.S. in Condensed Matter Physics

Sep 2001 - Jul 2004

2yrs 10mos

Sep 2001 - Jul 2004

2yrs 10mos

See more

Microsoft Excel

Microsoft Excel

B.S. in Physics Education

Sep 1997 - Jul 2001

3yrs 10mos

Sep 1997 - Jul 2001

3yrs 10mos

Tech Stack

Spreadsheets Online
Microsoft Excel

Microsoft Excel

Senior

Google Sheets

Google Sheets

Junior

Databases
MySQL

MySQL

Senior

Microsoft Access

Microsoft Access

Mid-level

Microsoft SQL Server

Microsoft SQL Server

Junior

Languages
Python

Python

Mid-level

Text Editor
RStudio

RStudio

Mid-level

Business Intelligence
Tableau

Tableau

Mid-level

Productivity Suite
Microsoft Office 365

Microsoft Office 365

Mid-level

Data Notebooks
Jupyter

Jupyter

Mid-level

Operating Systems
Windows 10

Windows 10

Mid-level

macOS

macOS

Mid-level

Data Science
Pandas

Pandas

Mid-level

NumPy

NumPy

Mid-level

Charting Libraries
ggplot2

ggplot2

Mid-level

Matplotlib

Matplotlib

Mid-level

General Analytics
Google Analytics

Google Analytics

Junior

Database Tools
Microsoft SQL Server Manage...

Microsoft SQL Server Manage...

Junior

Machine Learning
scikit-learn

scikit-learn

Beginner

Copyright © 2025 Sipu Hou

Built with Showwcase