Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | SQUASH: arxiv importer syntax | Bryan Newbold | 2019-05-22 | 1 | -1/+1 |
| | |||||
* | stderr.write() needs newline | Bryan Newbold | 2019-05-22 | 1 | -1/+1 |
| | |||||
* | better JALC and arxiv DOI checks | Bryan Newbold | 2019-05-22 | 2 | -2/+4 |
| | |||||
* | more arxiv polish | Bryan Newbold | 2019-05-21 | 1 | -22/+29 |
| | |||||
* | yet another JALC edge-case | Bryan Newbold | 2019-05-21 | 1 | -1/+1 |
| | |||||
* | arxiv importer robustification and CLI impl | Bryan Newbold | 2019-05-21 | 1 | -9/+20 |
| | |||||
* | better JALC DOI de-mangling | Bryan Newbold | 2019-05-21 | 1 | -1/+10 |
| | |||||
* | JALC importer requires a valid DOI | Bryan Newbold | 2019-05-21 | 1 | -0/+1 |
| | |||||
* | handle bad JALC DOIs | Bryan Newbold | 2019-05-21 | 1 | -1/+3 |
| | |||||
* | JALC more robust to partial names | Bryan Newbold | 2019-05-21 | 1 | -8/+19 |
| | |||||
* | more JALC importer tweaks | Bryan Newbold | 2019-05-21 | 1 | -7/+10 |
| | |||||
* | JALC importer: handle missing titles | Bryan Newbold | 2019-05-21 | 1 | -0/+2 |
| | |||||
* | importers: create containers by default | Bryan Newbold | 2019-05-21 | 4 | -4/+8 |
| | |||||
* | more JALC importer polish | Bryan Newbold | 2019-05-21 | 1 | -4/+17 |
| | |||||
* | JALC bulk file importer | Bryan Newbold | 2019-05-21 | 2 | -1/+21 |
| | |||||
* | correct JSTOR fix | Bryan Newbold | 2019-05-21 | 1 | -6/+6 |
| | |||||
* | fix lint errors in JSTOR importer | Bryan Newbold | 2019-05-21 | 1 | -17/+16 |
| | |||||
* | arxiv importer polish | Bryan Newbold | 2019-05-21 | 1 | -3/+4 |
| | |||||
* | JSTOR importer polish | Bryan Newbold | 2019-05-21 | 1 | -14/+38 |
| | |||||
* | updates to pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -33/+80 |
| | |||||
* | fix lint issue in pubmed importer | Bryan Newbold | 2019-05-21 | 1 | -1/+1 |
| | |||||
* | tweaks to new imports/tests | Bryan Newbold | 2019-05-21 | 5 | -28/+94 |
| | |||||
* | initial pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -2/+515 |
| | |||||
* | arxiv license/slug map | Bryan Newbold | 2019-05-21 | 1 | -0/+1 |
| | |||||
* | missing jstor import test (and fix typo) | Bryan Newbold | 2019-05-21 | 1 | -2/+1 |
| | |||||
* | initial arxivraw importer (from parser) | Bryan Newbold | 2019-05-21 | 2 | -0/+299 |
| | |||||
* | clean up JALC importer a tiny bit | Bryan Newbold | 2019-05-21 | 1 | -8/+3 |
| | |||||
* | initial JSTOR importer | Bryan Newbold | 2019-05-21 | 2 | -0/+271 |
| | |||||
* | initial flesh out of JALC parser | Bryan Newbold | 2019-05-21 | 3 | -1/+348 |
| | |||||
* | include creator_ids in release elastic schema | Bryan Newbold | 2019-05-20 | 1 | -0/+6 |
| | | | | Intent is to allow fast creator search/lookup | ||||
* | include structured contrib names in CDL/dash importer | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
| | |||||
* | elastic release schema update | Bryan Newbold | 2019-05-20 | 1 | -2/+5 |
| | |||||
* | improved CSL transform (structured author names) | Bryan Newbold | 2019-05-20 | 1 | -12/+11 |
| | |||||
* | make some XXX into TODO | Bryan Newbold | 2019-05-20 | 1 | -2/+2 |
| | |||||
* | fix elastic file pdf check | Bryan Newbold | 2019-05-16 | 1 | -1/+3 |
| | |||||
* | elastic transforms: work around missing pdf mimetypes | Bryan Newbold | 2019-05-15 | 1 | -1/+1 |
| | |||||
* | fix default mimetype (impacted pre-1923 files) | Bryan Newbold | 2019-05-15 | 2 | -4/+9 |
| | |||||
* | python impl | Bryan Newbold | 2019-05-14 | 9 | -32/+38 |
| | |||||
* | python impl | Bryan Newbold | 2019-05-14 | 6 | -16/+16 |
| | |||||
* | python: impl size_bytes -> size | Bryan Newbold | 2019-05-13 | 1 | -1/+1 |
| | |||||
* | importer code updates | Bryan Newbold | 2019-05-13 | 4 | -3/+18 |
| | |||||
* | partial python impl of ext_id and release_stage refactors | Bryan Newbold | 2019-05-13 | 5 | -29/+35 |
| | |||||
* | handle null abstracts for release | Bryan Newbold | 2019-05-07 | 1 | -1/+1 |
| | |||||
* | add limits to match importers | Bryan Newbold | 2019-04-23 | 3 | -2/+27 |
| | |||||
* | archive.org isn't really a repository | Bryan Newbold | 2019-04-22 | 1 | -1/+3 |
| | |||||
* | editgroup description override | Bryan Newbold | 2019-04-22 | 1 | -2/+2 |
| | |||||
* | arabesque importer does require timestamp/wayback | Bryan Newbold | 2019-04-22 | 1 | -0/+3 |
| | |||||
* | matched importer shouldn't require wayback | Bryan Newbold | 2019-04-22 | 1 | -5/+7 |
| | |||||
* | handle API 400 in arabesque import (invalid extid) | Bryan Newbold | 2019-04-19 | 1 | -7/+14 |
| | |||||
* | fix arabesque importer crawl_id None bug | Bryan Newbold | 2019-04-18 | 1 | -1/+1 |
| |