Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | allow importing contrib/refs lists | Bryan Newbold | 2019-01-24 | 4 | -13/+50 |
| | | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists. | ||||
* | codegen schema tweaks | Bryan Newbold | 2019-01-24 | 5 | -20/+61 |
| | |||||
* | Merge branch 'schema-tweaks' | Bryan Newbold | 2019-01-24 | 1 | -16/+8 |
|\ | |||||
| * | more IDENT types in API schema | Bryan Newbold | 2019-01-14 | 1 | -16/+8 |
| | | |||||
* | | more 2019-01-16 import timing | Bryan Newbold | 2019-01-24 | 1 | -0/+70 |
| | | |||||
* | | notes on refactoring container 'extra' | Bryan Newbold | 2019-01-24 | 1 | -0/+79 |
| | | |||||
* | | first-pass journal metadata munger | Bryan Newbold | 2019-01-24 | 5 | -0/+512 |
| | | |||||
* | | importer bugfixes | Bryan Newbold | 2019-01-23 | 3 | -8/+14 |
| | | |||||
* | | more import script fixes | Bryan Newbold | 2019-01-23 | 1 | -1/+4 |
| | | |||||
* | | initial changelog and container ES schemas | Bryan Newbold | 2019-01-23 | 2 | -0/+113 |
| | | |||||
* | | start changes to release ES schema | Bryan Newbold | 2019-01-23 | 5 | -141/+234 |
| | | |||||
* | | bunch of crossref import tweaks (need tests) | Bryan Newbold | 2019-01-23 | 1 | -50/+43 |
| | | |||||
* | | try to fix any_abstract | Bryan Newbold | 2019-01-23 | 1 | -1/+1 |
| | | |||||
* | | clean() checks if it returns null-length string | Bryan Newbold | 2019-01-23 | 1 | -1/+5 |
| | | |||||
* | | ensure no zero-length strings in SQL schema | Bryan Newbold | 2019-01-23 | 1 | -43/+43 |
| | | |||||
* | | update importer script | Bryan Newbold | 2019-01-23 | 1 | -33/+24 |
| | | |||||
* | | matched importer: bezerk mode to skip file updates | Bryan Newbold | 2019-01-23 | 1 | -11/+5 |
| | | |||||
* | | ensure crossref importer doesn't create empty editgroups | Bryan Newbold | 2019-01-23 | 1 | -0/+2 |
| | | |||||
* | | ftfy all over (needs Pipfile.lock) | Bryan Newbold | 2019-01-23 | 8 | -39/+75 |
| | | |||||
* | | add missing date | Bryan Newbold | 2019-01-23 | 1 | -1/+1 |
| | | |||||
* | | more tests; fix some importer behavior | Bryan Newbold | 2019-01-23 | 7 | -50/+111 |
| | | |||||
* | | specific test for desc/extra in editgroups | Bryan Newbold | 2019-01-23 | 1 | -2/+26 |
| | | |||||
* | | improve changelog tests | Bryan Newbold | 2019-01-23 | 6 | -12/+15 |
| | | |||||
* | | refactor remaining importers | Bryan Newbold | 2019-01-22 | 13 | -356/+324 |
| | | |||||
* | | allow passing description+extra to batch endpoints | Bryan Newbold | 2019-01-22 | 14 | -143/+638 |
| | | | | | | | | | | | | Pretty messy, but I needed some way to do this. In particular, requires json.dumps() in python code, for now. Blech. | ||||
* | | refactored crossref importer to new style | Bryan Newbold | 2019-01-22 | 5 | -118/+198 |
| | | |||||
* | | new importer API interfaces | Bryan Newbold | 2019-01-22 | 2 | -0/+181 |
| | | |||||
* | | crossref importer updates | Bryan Newbold | 2019-01-22 | 4 | -22/+82 |
| | | |||||
* | | add helper/hack script to generate bots | Bryan Newbold | 2019-01-22 | 1 | -0/+25 |
| | | |||||
* | | pubmed+datacite tokens; no journal,grobid,matched tokens | Bryan Newbold | 2019-01-22 | 2 | -5/+4 |
| | | |||||
* | | fix issn -> journal-metadata rename | Bryan Newbold | 2019-01-22 | 1 | -1/+1 |
| | | |||||
* | | MAG schema notes | Bryan Newbold | 2019-01-22 | 1 | -0/+65 |
| | | |||||
* | | 2019-01-16 QA import timing notes | Bryan Newbold | 2019-01-22 | 1 | -0/+422 |
| | | |||||
* | | more per-entity tests | Bryan Newbold | 2019-01-22 | 7 | -58/+312 |
| | | |||||
* | | add missing arxiv+jstor id indices | Bryan Newbold | 2019-01-22 | 1 | -0/+2 |
| | | |||||
* | | allow arxiv and jstor lookups | Bryan Newbold | 2019-01-21 | 12 | -13/+106 |
| | | |||||
* | | remove coden and abbrev from python tools | Bryan Newbold | 2019-01-21 | 2 | -8/+0 |
| | | |||||
* | | rust impl of new fields | Bryan Newbold | 2019-01-21 | 3 | -14/+38 |
| | | |||||
* | | codegen | Bryan Newbold | 2019-01-21 | 7 | -124/+297 |
| | | |||||
* | | SQL schema bump | Bryan Newbold | 2019-01-21 | 1 | -9/+12 |
| | | |||||
* | | yet more schema tweaks | Bryan Newbold | 2019-01-21 | 2 | -9/+41 |
| | | | | | | | | | | | | | | | | | | - remove abbrev and coden from container (never used; can put in extra) - add 'original_title' to release - add arxiv and JSTOR release IDs - add 'license_slug' to release - add 'raw_affiliation' string to release_contrib - add 'container_type' to container | ||||
* | | acutaly expand filesets/webcaptures | Bryan Newbold | 2019-01-18 | 1 | -1/+21 |
| | | |||||
* | | include filesets and webcaptures in exports | Bryan Newbold | 2019-01-18 | 2 | -2/+2 |
| | | |||||
* | | basic tests for filesets and webcaptures | Bryan Newbold | 2019-01-18 | 2 | -0/+160 |
| | | |||||
* | | fix typo in elastic transform code | Bryan Newbold | 2019-01-18 | 1 | -1/+1 |
| | | |||||
* | | python codegen | Bryan Newbold | 2019-01-18 | 2 | -3/+7 |
| | | |||||
* | | update import README with times | Bryan Newbold | 2019-01-18 | 1 | -2/+3 |
| | | |||||
* | | impl cdx timestamps as datetime | Bryan Newbold | 2019-01-18 | 3 | -5/+5 |
| | | |||||
* | | rust codegen | Bryan Newbold | 2019-01-18 | 4 | -13/+23 |
| | | |||||
* | | sql schema: cdx timestamps as datetime | Bryan Newbold | 2019-01-18 | 1 | -7/+7 |
| | |