This repository has been archived on 2025-01-04. You can view files and clone it, but cannot push or open issues or pull requests.
Sebastian Lenzlinger aeeb2e368d Merge branch 'sebl' into 'main'
Add code to create Artifacts for Data Analysis and Presentation.

See merge request dbis/lecture-groups/database-systems/2023hs/group-1!5
2024-01-06 15:57:11 +00:00
2024-01-05 13:52:23 +01:00
2024-01-06 16:39:58 +01:00
2024-01-05 19:09:47 +01:00
2024-01-04 15:47:54 +01:00

Database Project Group 1

Preliminaries

  • Ensure you have access to a running postgres instance
  • Ensure you have python3 and pip installed.
  • From within the root of the project, run pip install -r requirements.txt. This insures all python dependecies are met.
  • In src/fill_db.py look for the db_info variable and adapt it to your credentials.

Action

In the following the order matters.

  1. Run ensure_dirs_exist.py. This makes sure all the directories needed to perform the data integration and logging exist.
  2. Run integrate.py. Adjust the main method to fit your needs. In particular adjust the process_all_data() method, such that the parameter corresponding to a dataset is False if the script shall download it form the internet, and True else. To get geojson data form signaled speed in to city of Zurich uncomment the line in the main method where you find load_tempo_geojson_from_api_to_local()
  3. Run fill_db.py. This will load the data into the database based on the credentials given in the db_info variable.
  4. Perform Analysis.
Description
Languages
Jupyter Notebook 92.1%
Python 7.8%