...
 
Commits (2)
......@@ -32,7 +32,12 @@ This is our simplified tree of directories and files
│   │   │      ├── reviews.csv
│   │   │      └── reviews.csv.gz
│   │   ├── contornos shapes
│   │   └── demografia-vivienda
│   │   │   ├── barrios_geo.json
│   │   │   └── distritos_geo.json
│   │   ├── demografia-vivienda
│   │   │   └── poblacion.barrio.sexo.edad.2019.csv
│   │   └── socio-economico
│   │      └── renta.distrito.barrio.2015.2016.csv
│   └── output processed data
│   └── airbnb
├── images output images
......@@ -49,7 +54,18 @@ This is our simplified tree of directories and files
├── taller
│   └── team1 files of one of the workshop groups
```
## How to run Inside Airbnb scraper
## Data sources
+ Airbnb datasets come from [Inside Airbnb](http://insideairbnb.com).
+ Neighbourhoods geojson files come from [Inside Airbnb](http://insideairbnb.com).
+ Districts geojon come from [DataHippo](https://datahippo.org).
+ Population data comes from [Padrón Sevilla](https://www.sevilla.org/servicios/servicio-de-estadistica/datos-estadisticos/explotacion-estadistica-padron).
+ Income data comes from [INE](https://www.ine.es/jaxiT3/Tabla.htm?t=31213)
## Scraping
### How to run Inside Airbnb scraper
Inside Airbnb scraper is a python script that seacrh for all available files for a territory in insideairbnb.com site and download them all.
The script is located in `scraping/` folder and it is call ia.dataset.download.get.urls.py.
......
This diff is collapsed.
../airbnb/190930/neighbourhoods.geojson
\ No newline at end of file
This source diff could not be displayed because it is too large. You can view the blob instead.
This source diff could not be displayed because it is too large. You can view the blob instead.