Automatidata-ML

Project: New York City Taxi Fare Prediction

The New York City Taxi and Limousine Commission seeks a way to utilize the data collected from the New York City area to predict the fare amount for taxi cab rides. The project comprises five notebooks, each focusing on a different aspect of the data analysis process.

Each notebook plays a crucial role in understanding the data, conducting analysis, building models, and providing insights regarding fare prediction and customer tipping behavior in New York City taxi cab rides.

Notebook 1: Data Inspection and Analysis

Data Overview

Data Overview

Data Quality Issues

Data Quality Issues

Insights

Notebook 2: Exploratory Data Analysis (EDA)

Trip Characteristics Analysis

Trip Characteristics Analysis

Insights1 Insights2

Passenger Count and Ride Frequency

Passenger Count1 Passenger Count2

Trip Characteristics Analysis

Temporal Analysis

Daily and Monthly Ride Patterns

Seasonal Trends

Geographic Analysis

Location Density1 Location Density2

Limitations and Further Investigation

This exploratory analysis provides a comprehensive understanding of the characteristics and patterns within the taxi ride dataset, laying the groundwork for further analysis and modeling in subsequent notebooks.

Notebook 3: Hypothesis testing

Hypothesis and Test Results

Hypothesis and Test Results

Business Insight

Realism of the A/B Project

This analysis provides valuable insights into how payment method can impact taxi drivers’ revenues, but also highlights the limitations and necessary assumptions in conducting A/B testing projects in a real-world setting.

Notebook 4: Multiple Linear Regression Model

Outlier Analysis and Data Preprocessing

Outliers Detection:

Outliers Detection

Handling Outliers:

Feature Engineering and Selection

New Feature Creation:

Correlation Analysis:

Correlation Analysis

Model Building and Evaluation

Model Performance:

Coefficients Interpretation:

Coefficients Analysis

Key Takeaways

Notebook 5: Ethical Considerations and Model Evaluation

Ethical Implications

Modification to Modeling Objective

Model Evaluation

Model Evaluation

Recommendations

Recommendations

Conclusion