Dask-based data ingester + explorer
- Load any dataset quickly into sqllite (in parallel wherever possible)
- Automatically infer schema
- Generate django model classes to be able to explore the dataset via the ORM
- Generate django admin list views based on configuration (with support for filters)
- Any dataset supported by dask
- django-rest-framework API support
- Generic HTTP API support
- New ones may be added by subclassing
aragog.fetchers.base.Fetcherwhich uses the pythoniteratorprotocol
idis a reserved column name and if a dataset has a column by that name, it will have to be skipped (checkpackages.yml.examplefor example)- All datasets are loaded on server start. This is done to be able to infer the schema and generate the model classes. This should instead be calculated once and persisted to the DB. Generating model classes can then by done by reading this persisted schema
- Clone this repo
- Rename
packages.yml.exampletopackages.yml(or configure an appropriate location insettings.PACKAGES_FILE $ ./manage.py runserver