Skip to content

Statistica exploratory data analysis (EDA) of 600K+ Uber/Lyft rides in Boston. Features automated data wrangling, hypothesis testing on pricing trends, and visualization of weather/distance correlations using Python, Pandas, and Seaborn.

Notifications You must be signed in to change notification settings

ChaseH01/ExploratoryDataAnalysis_Lyft-UberData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

Exploring Uber and Lyft Prices in Boston

Overview

This project presents an exploratory data analysis (EDA) of a dataset containing Uber and Lyft ride information and pricing in Boston, MA. The goal was to uncover insights into ride characteristics, pricing trends, and the influence of factors like time, distance, and weather. Through this analysis, we aimed to understand patterns within the rideshare data, demonstrating a practical approach to extracting meaningful information from real-world datasets. Note: This Overview section was created with AI.

Contribution

This version represents my personal adaptation of a project originally developed in collaboration with Matteo Dall'Olmo. It features my own refinements and revised documentation for clarity and portfolio purposes.

Methodology

The analysis followed a structured EDA process, which included:

  1. Data Acquisition and Basic Understanding:
    Loading the dataset and getting an initial feel for its structure and content.

  2. Data Wrangling:
    Cleaning and preparing the data for analysis.

  3. Data Profiling:
    Understanding the distribution and characteristics of individual variables.

  4. Hypothesis Development:
    Formulating questions to investigate based on initial observations.

  5. Hypothesis Investigation:
    Using data to test developed hypotheses.

  6. Results Summary:
    Summarizing findings and answering the initial questions.

  7. Critical Review:
    Reviewing the workflow and discussing ethical considerations.

About

Statistica exploratory data analysis (EDA) of 600K+ Uber/Lyft rides in Boston. Features automated data wrangling, hypothesis testing on pricing trends, and visualization of weather/distance correlations using Python, Pandas, and Seaborn.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors