Project Structure

├── data_clean
├── data_raw
├── documentation
├── results
│   ├── Plots
├── scripts
└── sql

Data Extraction

All the data were downloaded from respective sources manually, by selecting indicators, fields or specific files. The downloaded raw data were stored in data_raw directory. The data ranges from minimum 1980 to maximum 2024.

Cleaning Data

The raw data were cleaned using python scripts in scripts directory and saved in data_clean directory.

SQL

Postgresql was used for creating the database of the cleaned data along with the schema, doing test analysis, saving data tables and exporting shaped tables for visualization. In the sql directory files –

  • schema.sql defined tables dim_time, dim_indicator, dim_source, fact_macro, fact_banking. The latter 2 were the tables containing the suggestive indicator values while the prior ones were purely for mapping in between the latter without any ambiguity.
  • load_macro.sql, load_fsi.sql, load_wb.sql were suggestively to load the cleaned data from data_clean directory.
  • views.sql defined tables joining the original tables to make the analysis easier.

The analytical queries were done in the interactive database sessions in local system terminal and the necessary ones were saved as csv extracts for visualization in the results directory.

Results

The following results were used as inputs for the analysis presented in subsequent sections.

Macroeconomic Results

The macroeconomic indicator sub-tables as results are:

Banking Results

The banking sector indicator sub-tables as results are:

Cross-linked Results

The results that are indicative of the cross-sector outcomes: