Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | pubmed importer command and tweaks | Bryan Newbold | 2019-05-22 | 1 | -9/+227 | |
| | ||||||
* | bs4 XML parse cleanup | Bryan Newbold | 2019-05-22 | 1 | -0/+2 | |
| | ||||||
* | arxiv license slug shorter; fix test | Bryan Newbold | 2019-05-22 | 1 | -1/+1 | |
| | ||||||
* | SQUASH: arxiv importer syntax | Bryan Newbold | 2019-05-22 | 1 | -1/+1 | |
| | ||||||
* | stderr.write() needs newline | Bryan Newbold | 2019-05-22 | 1 | -1/+1 | |
| | ||||||
* | better JALC and arxiv DOI checks | Bryan Newbold | 2019-05-22 | 2 | -2/+4 | |
| | ||||||
* | more arxiv polish | Bryan Newbold | 2019-05-21 | 1 | -22/+29 | |
| | ||||||
* | yet another JALC edge-case | Bryan Newbold | 2019-05-21 | 1 | -1/+1 | |
| | ||||||
* | arxiv importer robustification and CLI impl | Bryan Newbold | 2019-05-21 | 1 | -9/+20 | |
| | ||||||
* | better JALC DOI de-mangling | Bryan Newbold | 2019-05-21 | 1 | -1/+10 | |
| | ||||||
* | JALC importer requires a valid DOI | Bryan Newbold | 2019-05-21 | 1 | -0/+1 | |
| | ||||||
* | handle bad JALC DOIs | Bryan Newbold | 2019-05-21 | 1 | -1/+3 | |
| | ||||||
* | JALC more robust to partial names | Bryan Newbold | 2019-05-21 | 1 | -8/+19 | |
| | ||||||
* | more JALC importer tweaks | Bryan Newbold | 2019-05-21 | 1 | -7/+10 | |
| | ||||||
* | JALC importer: handle missing titles | Bryan Newbold | 2019-05-21 | 1 | -0/+2 | |
| | ||||||
* | importers: create containers by default | Bryan Newbold | 2019-05-21 | 4 | -4/+8 | |
| | ||||||
* | more JALC importer polish | Bryan Newbold | 2019-05-21 | 1 | -4/+17 | |
| | ||||||
* | JALC bulk file importer | Bryan Newbold | 2019-05-21 | 2 | -1/+21 | |
| | ||||||
* | correct JSTOR fix | Bryan Newbold | 2019-05-21 | 1 | -6/+6 | |
| | ||||||
* | fix lint errors in JSTOR importer | Bryan Newbold | 2019-05-21 | 1 | -17/+16 | |
| | ||||||
* | arxiv importer polish | Bryan Newbold | 2019-05-21 | 1 | -3/+4 | |
| | ||||||
* | JSTOR importer polish | Bryan Newbold | 2019-05-21 | 1 | -14/+38 | |
| | ||||||
* | updates to pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -33/+80 | |
| | ||||||
* | fix lint issue in pubmed importer | Bryan Newbold | 2019-05-21 | 1 | -1/+1 | |
| | ||||||
* | tweaks to new imports/tests | Bryan Newbold | 2019-05-21 | 5 | -28/+94 | |
| | ||||||
* | initial pubmed importer | Bryan Newbold | 2019-05-21 | 2 | -2/+515 | |
| | ||||||
* | arxiv license/slug map | Bryan Newbold | 2019-05-21 | 1 | -0/+1 | |
| | ||||||
* | missing jstor import test (and fix typo) | Bryan Newbold | 2019-05-21 | 1 | -2/+1 | |
| | ||||||
* | initial arxivraw importer (from parser) | Bryan Newbold | 2019-05-21 | 2 | -0/+299 | |
| | ||||||
* | clean up JALC importer a tiny bit | Bryan Newbold | 2019-05-21 | 1 | -8/+3 | |
| | ||||||
* | initial JSTOR importer | Bryan Newbold | 2019-05-21 | 2 | -0/+271 | |
| | ||||||
* | initial flesh out of JALC parser | Bryan Newbold | 2019-05-21 | 3 | -1/+348 | |
| | ||||||
* | include structured contrib names in CDL/dash importer | Bryan Newbold | 2019-05-20 | 1 | -2/+2 | |
| | ||||||
* | fix default mimetype (impacted pre-1923 files) | Bryan Newbold | 2019-05-15 | 2 | -4/+9 | |
| | ||||||
* | python impl | Bryan Newbold | 2019-05-14 | 9 | -32/+38 | |
| | ||||||
* | python impl | Bryan Newbold | 2019-05-14 | 6 | -16/+16 | |
| | ||||||
* | python: impl size_bytes -> size | Bryan Newbold | 2019-05-13 | 1 | -1/+1 | |
| | ||||||
* | importer code updates | Bryan Newbold | 2019-05-13 | 4 | -3/+18 | |
| | ||||||
* | partial python impl of ext_id and release_stage refactors | Bryan Newbold | 2019-05-13 | 3 | -15/+20 | |
| | ||||||
* | add limits to match importers | Bryan Newbold | 2019-04-23 | 3 | -2/+27 | |
| | ||||||
* | archive.org isn't really a repository | Bryan Newbold | 2019-04-22 | 1 | -1/+3 | |
| | ||||||
* | editgroup description override | Bryan Newbold | 2019-04-22 | 1 | -2/+2 | |
| | ||||||
* | arabesque importer does require timestamp/wayback | Bryan Newbold | 2019-04-22 | 1 | -0/+3 | |
| | ||||||
* | matched importer shouldn't require wayback | Bryan Newbold | 2019-04-22 | 1 | -5/+7 | |
| | ||||||
* | handle API 400 in arabesque import (invalid extid) | Bryan Newbold | 2019-04-19 | 1 | -7/+14 | |
| | ||||||
* | fix arabesque importer crawl_id None bug | Bryan Newbold | 2019-04-18 | 1 | -1/+1 | |
| | ||||||
* | mechanism to not double-update entities | Bryan Newbold | 2019-04-18 | 2 | -1/+9 | |
| | ||||||
* | minor arabesque tweaks | Bryan Newbold | 2019-04-18 | 1 | -0/+2 | |
| | ||||||
* | update URL rel list | Bryan Newbold | 2019-04-18 | 1 | -1/+10 | |
| | ||||||
* | arabesque importer does fewer updates | Bryan Newbold | 2019-04-18 | 1 | -1/+8 | |
| |