Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | JALC importer: handle missing titles | Bryan Newbold | 2019-05-21 | 1 | -0/+2 |
* | importers: create containers by default | Bryan Newbold | 2019-05-21 | 4 | -4/+8 |
* | more JALC importer polish | Bryan Newbold | 2019-05-21 | 1 | -4/+17 |
* | JALC bulk file importer | Bryan Newbold | 2019-05-21 | 2 | -1/+21 |
* | correct JSTOR fix | Bryan Newbold | 2019-05-21 | 1 | -6/+6 |
* | fix lint errors in JSTOR importer | Bryan Newbold | 2019-05-21 | 1 | -17/+16 |
* | arxiv importer polish | Bryan Newbold | 2019-05-21 | 1 | -3/+4 |
* | JSTOR importer polish | Bryan Newbold | 2019-05-21 | 1 | -14/+38 |
* | updates to pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -33/+80 |
* | fix lint issue in pubmed importer | Bryan Newbold | 2019-05-21 | 1 | -1/+1 |
* | tweaks to new imports/tests | Bryan Newbold | 2019-05-21 | 5 | -28/+94 |
* | initial pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -2/+515 |
* | arxiv license/slug map | Bryan Newbold | 2019-05-21 | 1 | -0/+1 |
* | missing jstor import test (and fix typo) | Bryan Newbold | 2019-05-21 | 1 | -2/+1 |
* | initial arxivraw importer (from parser) | Bryan Newbold | 2019-05-21 | 2 | -0/+299 |
* | clean up JALC importer a tiny bit | Bryan Newbold | 2019-05-21 | 1 | -8/+3 |
* | initial JSTOR importer | Bryan Newbold | 2019-05-21 | 2 | -0/+271 |
* | initial flesh out of JALC parser | Bryan Newbold | 2019-05-21 | 3 | -1/+348 |
* | include creator_ids in release elastic schema | Bryan Newbold | 2019-05-20 | 1 | -0/+6 |
* | include structured contrib names in CDL/dash importer | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
* | elastic release schema update | Bryan Newbold | 2019-05-20 | 1 | -2/+5 |
* | improved CSL transform (structured author names) | Bryan Newbold | 2019-05-20 | 1 | -12/+11 |
* | make some XXX into TODO | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
* | fix elastic file pdf check | Bryan Newbold | 2019-05-16 | 1 | -1/+3 |
* | elastic transforms: work around missing pdf mimetypes | Bryan Newbold | 2019-05-15 | 1 | -1/+1 |
* | fix default mimetype (impacted pre-1923 files) | Bryan Newbold | 2019-05-15 | 2 | -4/+9 |
* | python impl | Bryan Newbold | 2019-05-14 | 9 | -32/+38 |
* | python impl | Bryan Newbold | 2019-05-14 | 6 | -16/+16 |
* | python: impl size_bytes -> size | Bryan Newbold | 2019-05-13 | 1 | -1/+1 |
* | importer code updates | Bryan Newbold | 2019-05-13 | 4 | -3/+18 |
* | partial python impl of ext_id and release_stage refactors | Bryan Newbold | 2019-05-13 | 5 | -29/+35 |
* | handle null abstracts for release | Bryan Newbold | 2019-05-07 | 1 | -1/+1 |
* | add limits to match importers | Bryan Newbold | 2019-04-23 | 3 | -2/+27 |
* | archive.org isn't really a repository | Bryan Newbold | 2019-04-22 | 1 | -1/+3 |
* | editgroup description override | Bryan Newbold | 2019-04-22 | 1 | -2/+2 |
* | arabesque importer does require timestamp/wayback | Bryan Newbold | 2019-04-22 | 1 | -0/+3 |
* | matched importer shouldn't require wayback | Bryan Newbold | 2019-04-22 | 1 | -5/+7 |
* | handle API 400 in arabesque import (invalid extid) | Bryan Newbold | 2019-04-19 | 1 | -7/+14 |
* | fix arabesque importer crawl_id None bug | Bryan Newbold | 2019-04-18 | 1 | -1/+1 |
* | mechanism to not double-update entities | Bryan Newbold | 2019-04-18 | 2 | -1/+9 |
* | minor arabesque tweaks | Bryan Newbold | 2019-04-18 | 1 | -0/+2 |
* | update URL rel list | Bryan Newbold | 2019-04-18 | 1 | -1/+10 |
* | arabesque importer does fewer updates | Bryan Newbold | 2019-04-18 | 1 | -1/+8 |
* | arabesque importer | Bryan Newbold | 2019-04-18 | 1 | -0/+165 |
* | early version of arabesque importer | Bryan Newbold | 2019-04-12 | 1 | -0/+1 |
* | add SqlitePusher importer option | Bryan Newbold | 2019-04-12 | 2 | -1/+21 |
* | fix reviewer bugs (thanks pylint) | Bryan Newbold | 2019-04-06 | 1 | -3/+3 |
* | basic dummy review bot | Bryan Newbold | 2019-04-06 | 2 | -0/+239 |
* | improve test coverage | Bryan Newbold | 2019-04-04 | 1 | -0/+1 |
* | increase default harvest window to 14 days | Bryan Newbold | 2019-04-01 | 1 | -2/+2 |