Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | more arxiv polish | Bryan Newbold | 2019-05-21 | 1 | -22/+29 |
* | yet another JALC edge-case | Bryan Newbold | 2019-05-21 | 1 | -1/+1 |
* | arxiv importer robustification and CLI impl | Bryan Newbold | 2019-05-21 | 1 | -9/+20 |
* | better JALC DOI de-mangling | Bryan Newbold | 2019-05-21 | 1 | -1/+10 |
* | JALC importer requires a valid DOI | Bryan Newbold | 2019-05-21 | 1 | -0/+1 |
* | handle bad JALC DOIs | Bryan Newbold | 2019-05-21 | 1 | -1/+3 |
* | JALC more robust to partial names | Bryan Newbold | 2019-05-21 | 1 | -8/+19 |
* | more JALC importer tweaks | Bryan Newbold | 2019-05-21 | 1 | -7/+10 |
* | JALC importer: handle missing titles | Bryan Newbold | 2019-05-21 | 1 | -0/+2 |
* | importers: create containers by default | Bryan Newbold | 2019-05-21 | 4 | -4/+8 |
* | more JALC importer polish | Bryan Newbold | 2019-05-21 | 1 | -4/+17 |
* | JALC bulk file importer | Bryan Newbold | 2019-05-21 | 2 | -1/+21 |
* | correct JSTOR fix | Bryan Newbold | 2019-05-21 | 1 | -6/+6 |
* | fix lint errors in JSTOR importer | Bryan Newbold | 2019-05-21 | 1 | -17/+16 |
* | arxiv importer polish | Bryan Newbold | 2019-05-21 | 1 | -3/+4 |
* | JSTOR importer polish | Bryan Newbold | 2019-05-21 | 1 | -14/+38 |
* | updates to pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -33/+80 |
* | fix lint issue in pubmed importer | Bryan Newbold | 2019-05-21 | 1 | -1/+1 |
* | tweaks to new imports/tests | Bryan Newbold | 2019-05-21 | 5 | -28/+94 |
* | initial pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -2/+515 |
* | arxiv license/slug map | Bryan Newbold | 2019-05-21 | 1 | -0/+1 |
* | missing jstor import test (and fix typo) | Bryan Newbold | 2019-05-21 | 1 | -2/+1 |
* | initial arxivraw importer (from parser) | Bryan Newbold | 2019-05-21 | 2 | -0/+299 |
* | clean up JALC importer a tiny bit | Bryan Newbold | 2019-05-21 | 1 | -8/+3 |
* | initial JSTOR importer | Bryan Newbold | 2019-05-21 | 2 | -0/+271 |
* | initial flesh out of JALC parser | Bryan Newbold | 2019-05-21 | 3 | -1/+348 |
* | include creator_ids in release elastic schema | Bryan Newbold | 2019-05-20 | 1 | -0/+6 |
* | include structured contrib names in CDL/dash importer | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
* | elastic release schema update | Bryan Newbold | 2019-05-20 | 1 | -2/+5 |
* | improved CSL transform (structured author names) | Bryan Newbold | 2019-05-20 | 1 | -12/+11 |
* | make some XXX into TODO | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
* | fix elastic file pdf check | Bryan Newbold | 2019-05-16 | 1 | -1/+3 |
* | elastic transforms: work around missing pdf mimetypes | Bryan Newbold | 2019-05-15 | 1 | -1/+1 |
* | fix default mimetype (impacted pre-1923 files) | Bryan Newbold | 2019-05-15 | 2 | -4/+9 |
* | python impl | Bryan Newbold | 2019-05-14 | 9 | -32/+38 |
* | python impl | Bryan Newbold | 2019-05-14 | 6 | -16/+16 |
* | python: impl size_bytes -> size | Bryan Newbold | 2019-05-13 | 1 | -1/+1 |
* | importer code updates | Bryan Newbold | 2019-05-13 | 4 | -3/+18 |
* | partial python impl of ext_id and release_stage refactors | Bryan Newbold | 2019-05-13 | 5 | -29/+35 |
* | handle null abstracts for release | Bryan Newbold | 2019-05-07 | 1 | -1/+1 |
* | add limits to match importers | Bryan Newbold | 2019-04-23 | 3 | -2/+27 |
* | archive.org isn't really a repository | Bryan Newbold | 2019-04-22 | 1 | -1/+3 |
* | editgroup description override | Bryan Newbold | 2019-04-22 | 1 | -2/+2 |
* | arabesque importer does require timestamp/wayback | Bryan Newbold | 2019-04-22 | 1 | -0/+3 |
* | matched importer shouldn't require wayback | Bryan Newbold | 2019-04-22 | 1 | -5/+7 |
* | handle API 400 in arabesque import (invalid extid) | Bryan Newbold | 2019-04-19 | 1 | -7/+14 |
* | fix arabesque importer crawl_id None bug | Bryan Newbold | 2019-04-18 | 1 | -1/+1 |
* | mechanism to not double-update entities | Bryan Newbold | 2019-04-18 | 2 | -1/+9 |
* | minor arabesque tweaks | Bryan Newbold | 2019-04-18 | 1 | -0/+2 |
* | update URL rel list | Bryan Newbold | 2019-04-18 | 1 | -1/+10 |