JUHE API Marketplace
DATASET
Open Source Community

Houses.csv

Dataset for a machine learning project on Polish house price prediction, containing detailed information such as location, size, and floor.

Updated 12/18/2023
github

Description

Polish House Price Prediction Dataset Overview

Dataset Structure

  • data/: Contains the raw dataset Houses.csv and preprocessed data files X_train.csv, X_test.csv, y_train.csv, y_test.csv.
  • models/: Stores trained models, including linear_regression_model.pkl and knn_model.pkl.
  • src/: Contains source code for data preprocessing, model training, and evaluation, such as preprocessing.py, linear_regression.py, knn.py, main.py.
  • notebooks/: Contains Jupyter notebooks for exploratory data analysis and model building, EDA.ipynb and Modeling.ipynb.

Model Information

  1. Linear Regression Model:

    • Trained using scikit‑learn's LinearRegression.
    • Model saved as models/linear_regression_model.pkl.
    • Evaluation metrics include mean squared error, R² score, and cross‑validation score.
  2. K‑Nearest Neighbors (KNN) Model:

    • Trained using scikit‑learn's KNeighborsRegressor.
    • Model saved as models/knn_model.pkl.
    • Evaluation metrics include mean squared error, R² score, and cross‑validation score.

Future Improvement Directions

  • Hyperparameter tuning: Try different configurations to improve model performance.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Real Estate Analysis
Machine Learning

Source

Organization: github

Created: 12/9/2023

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.