Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | no-derive metadata and SQL dump uploads (to petabox) | Bryan Newbold | 2019-04-30 | 1 | -0/+2 |
| | |||||
* | faster elasticsearch imports | Bryan Newbold | 2019-04-30 | 1 | -1/+1 |
| | |||||
* | more bots to bootstrap | Bryan Newbold | 2019-04-24 | 1 | -0/+15 |
| | |||||
* | update sql dump README | Bryan Newbold | 2019-04-24 | 1 | -9/+12 |
| | |||||
* | fix wild elastic schema typo | Bryan Newbold | 2019-04-12 | 1 | -1/+1 |
| | |||||
* | record webcaptures added as demos | Bryan Newbold | 2019-03-19 | 1 | -0/+45 |
| | |||||
* | new importer: wayback_static | Bryan Newbold | 2019-03-19 | 1 | -203/+0 |
| | |||||
* | update enrich examples demo script | Bryan Newbold | 2019-03-19 | 1 | -49/+63 |
| | |||||
* | initial wayback-to-webcapture helper | Bryan Newbold | 2019-03-19 | 1 | -0/+203 |
| | |||||
* | more integration of transform refactor | Bryan Newbold | 2019-03-11 | 1 | -2/+2 |
| | |||||
* | elastic schema indentation | Bryan Newbold | 2019-03-06 | 1 | -6/+6 |
| | |||||
* | gitignore SQL identifier dumps | Bryan Newbold | 2019-02-22 | 1 | -0/+1 |
| | |||||
* | include container_id in release ES schema | Bryan Newbold | 2019-02-22 | 1 | -0/+1 |
| | |||||
* | update ISSN-L file | Bryan Newbold | 2019-02-20 | 2 | -2/+6 |
| | |||||
* | robust-ify bootstrap bots script | Bryan Newbold | 2019-02-05 | 1 | -0/+7 |
| | |||||
* | start of README files for item uploads | Bryan Newbold | 2019-02-05 | 3 | -0/+26 |
| | |||||
* | use pigz over gzip in more places | Bryan Newbold | 2019-02-05 | 2 | -7/+15 |
| | |||||
* | update dump and sort commands | Bryan Newbold | 2019-02-01 | 2 | -7/+17 |
| | | | | | Pipeline sorts are *so* starved and slow ; they only get a few MByte of RAM by default! | ||||
* | update to newer ISSN-L mapping | Bryan Newbold | 2019-01-29 | 2 | -2/+2 |
| | |||||
* | helper to delete 'builtin' example entities | Bryan Newbold | 2019-01-29 | 1 | -0/+73 |
| | | | | Idea is to clear these before "real" metadata import. | ||||
* | minor typo in esbulk container import | Bryan Newbold | 2019-01-28 | 1 | -1/+1 |
| | |||||
* | more ES index name updates | Bryan Newbold | 2019-01-28 | 1 | -2/+3 |
| | |||||
* | add filesets and webcaptures to dumps | Bryan Newbold | 2019-01-28 | 4 | -1/+33 |
| | |||||
* | transform and import fixes/tweaks | Bryan Newbold | 2019-01-25 | 3 | -8/+122 |
| | |||||
* | improved journal metadata munger | Bryan Newbold | 2019-01-25 | 2 | -100/+325 |
| | |||||
* | tweak elastic schemas (again) | Bryan Newbold | 2019-01-25 | 2 | -6/+4 |
| | |||||
* | first-pass journal metadata munger | Bryan Newbold | 2019-01-24 | 5 | -0/+512 |
| | |||||
* | initial changelog and container ES schemas | Bryan Newbold | 2019-01-23 | 2 | -0/+113 |
| | |||||
* | start changes to release ES schema | Bryan Newbold | 2019-01-23 | 1 | -22/+39 |
| | |||||
* | add helper/hack script to generate bots | Bryan Newbold | 2019-01-22 | 1 | -0/+25 |
| | |||||
* | state in elasticsearch (and deleted/redirects) | Bryan Newbold | 2019-01-18 | 1 | -0/+1 |
| | |||||
* | local collectd example | Bryan Newbold | 2019-01-10 | 1 | -0/+22 |
| | |||||
* | remove redundant transform_release.py ES script | Bryan Newbold | 2018-12-24 | 2 | -88/+1 |
| | |||||
* | implement release_year (and rustfmt) | Bryan Newbold | 2018-12-24 | 1 | -0/+2 |
| | |||||
* | fixes to demo enrich script | Bryan Newbold | 2018-11-21 | 1 | -12/+20 |
| | |||||
* | progress on auto demo creation | Bryan Newbold | 2018-11-21 | 8 | -262/+166 |
| | |||||
* | start work on demo works | Bryan Newbold | 2018-11-21 | 2 | -0/+275 |
| | |||||
* | updated docker for elastic (with plugin) | Bryan Newbold | 2018-11-07 | 5 | -47/+11 |
| | | | | Still need to install the maps (aka, schemas) manually. | ||||
* | note elastic plugin needed | Bryan Newbold | 2018-11-07 | 2 | -0/+52 |
| | |||||
* | first draft docker-compose file and README | Bryan Newbold | 2018-11-07 | 2 | -0/+50 |
| | |||||
* | for now, store is_longtail_oa in container_is_longtail_oa | Bryan Newbold | 2018-10-12 | 1 | -0/+2 |
| | |||||
* | document need to LC_ALL=C.UTF-8 for ES import | Bryan Newbold | 2018-09-28 | 1 | -1/+2 |
| | |||||
* | fix typo in elastic load script | Bryan Newbold | 2018-09-26 | 1 | -1/+1 |
| | |||||
* | ignore more files | Bryan Newbold | 2018-09-25 | 1 | -0/+1 |
| | |||||
* | better default file names | Bryan Newbold | 2018-09-25 | 1 | -7/+7 |
| | |||||
* | fix typos in es/transform script | Bryan Newbold | 2018-09-25 | 1 | -3/+3 |
| | |||||
* | script for partitioning dumps (needs test) | Bryan Newbold | 2018-09-24 | 3 | -0/+68 |
| | |||||
* | fix typo/error in elastic schema | Bryan Newbold | 2018-09-24 | 1 | -1/+0 |
| | |||||
* | fix NoneType python error in transform_release | Bryan Newbold | 2018-09-24 | 1 | -1/+1 |
| | |||||
* | gitignore for elastic files | Bryan Newbold | 2018-09-22 | 1 | -0/+2 |
| |