DATASET
Open Source Community
Houses.csv
Dataset for a machine learning project on Polish house price prediction, containing detailed information such as location, size, and floor.
Updated 12/18/2023
github
Description
Polish House Price Prediction Dataset Overview
Dataset Structure
- data/: Contains the raw dataset
Houses.csvand preprocessed data filesX_train.csv,X_test.csv,y_train.csv,y_test.csv. - models/: Stores trained models, including
linear_regression_model.pklandknn_model.pkl. - src/: Contains source code for data preprocessing, model training, and evaluation, such as
preprocessing.py,linear_regression.py,knn.py,main.py. - notebooks/: Contains Jupyter notebooks for exploratory data analysis and model building,
EDA.ipynbandModeling.ipynb.
Model Information
-
Linear Regression Model:
- Trained using scikit‑learn's
LinearRegression. - Model saved as
models/linear_regression_model.pkl. - Evaluation metrics include mean squared error, R² score, and cross‑validation score.
- Trained using scikit‑learn's
-
K‑Nearest Neighbors (KNN) Model:
- Trained using scikit‑learn's
KNeighborsRegressor. - Model saved as
models/knn_model.pkl. - Evaluation metrics include mean squared error, R² score, and cross‑validation score.
- Trained using scikit‑learn's
Future Improvement Directions
- Hyperparameter tuning: Try different configurations to improve model performance.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Real Estate Analysis
Machine Learning
Source
Organization: github
Created: 12/9/2023
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.