Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | vastly improve entity_to_dict() speed | Bryan Newbold | 2019-01-28 | 1 | -1/+9 |
| | |||||
* | add filesets and webcaptures to dumps | Bryan Newbold | 2019-01-28 | 5 | -2/+35 |
| | |||||
* | fatcat -> fatcat_release ES index | Bryan Newbold | 2019-01-28 | 3 | -20/+21 |
| | |||||
* | transform and import fixes/tweaks | Bryan Newbold | 2019-01-25 | 8 | -30/+214 |
| | |||||
* | improved journal metadata munger | Bryan Newbold | 2019-01-25 | 2 | -100/+325 |
| | |||||
* | tweak elastic schemas (again) | Bryan Newbold | 2019-01-25 | 2 | -6/+4 |
| | |||||
* | update journal meta import/transform | Bryan Newbold | 2019-01-25 | 6 | -154/+226 |
| | |||||
* | grobid import extra metadata tweaks | Bryan Newbold | 2019-01-24 | 1 | -6/+7 |
| | |||||
* | refactor _get_editgroup => get_editgroup_id | Bryan Newbold | 2019-01-24 | 2 | -5/+6 |
| | |||||
* | refactor make_rel_url | Bryan Newbold | 2019-01-24 | 3 | -29/+66 |
| | |||||
* | examples of works with many authors (eg, used in tests) | Bryan Newbold | 2019-01-24 | 1 | -0/+6 |
| | |||||
* | tweak crossref import, and update tests | Bryan Newbold | 2019-01-24 | 5 | -32/+89 |
| | |||||
* | empty fields test | Bryan Newbold | 2019-01-24 | 1 | -0/+13 |
| | |||||
* | allow importing contrib/refs lists | Bryan Newbold | 2019-01-24 | 4 | -13/+50 |
| | | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists. | ||||
* | codegen schema tweaks | Bryan Newbold | 2019-01-24 | 5 | -20/+61 |
| | |||||
* | Merge branch 'schema-tweaks' | Bryan Newbold | 2019-01-24 | 1 | -16/+8 |
|\ | |||||
| * | more IDENT types in API schema | Bryan Newbold | 2019-01-14 | 1 | -16/+8 |
| | | |||||
* | | more 2019-01-16 import timing | Bryan Newbold | 2019-01-24 | 1 | -0/+70 |
| | | |||||
* | | notes on refactoring container 'extra' | Bryan Newbold | 2019-01-24 | 1 | -0/+79 |
| | | |||||
* | | first-pass journal metadata munger | Bryan Newbold | 2019-01-24 | 5 | -0/+512 |
| | | |||||
* | | importer bugfixes | Bryan Newbold | 2019-01-23 | 3 | -8/+14 |
| | | |||||
* | | more import script fixes | Bryan Newbold | 2019-01-23 | 1 | -1/+4 |
| | | |||||
* | | initial changelog and container ES schemas | Bryan Newbold | 2019-01-23 | 2 | -0/+113 |
| | | |||||
* | | start changes to release ES schema | Bryan Newbold | 2019-01-23 | 5 | -141/+234 |
| | | |||||
* | | bunch of crossref import tweaks (need tests) | Bryan Newbold | 2019-01-23 | 1 | -50/+43 |
| | | |||||
* | | try to fix any_abstract | Bryan Newbold | 2019-01-23 | 1 | -1/+1 |
| | | |||||
* | | clean() checks if it returns null-length string | Bryan Newbold | 2019-01-23 | 1 | -1/+5 |
| | | |||||
* | | ensure no zero-length strings in SQL schema | Bryan Newbold | 2019-01-23 | 1 | -43/+43 |
| | | |||||
* | | update importer script | Bryan Newbold | 2019-01-23 | 1 | -33/+24 |
| | | |||||
* | | matched importer: bezerk mode to skip file updates | Bryan Newbold | 2019-01-23 | 1 | -11/+5 |
| | | |||||
* | | ensure crossref importer doesn't create empty editgroups | Bryan Newbold | 2019-01-23 | 1 | -0/+2 |
| | | |||||
* | | ftfy all over (needs Pipfile.lock) | Bryan Newbold | 2019-01-23 | 8 | -39/+75 |
| | | |||||
* | | add missing date | Bryan Newbold | 2019-01-23 | 1 | -1/+1 |
| | | |||||
* | | more tests; fix some importer behavior | Bryan Newbold | 2019-01-23 | 7 | -50/+111 |
| | | |||||
* | | specific test for desc/extra in editgroups | Bryan Newbold | 2019-01-23 | 1 | -2/+26 |
| | | |||||
* | | improve changelog tests | Bryan Newbold | 2019-01-23 | 6 | -12/+15 |
| | | |||||
* | | refactor remaining importers | Bryan Newbold | 2019-01-22 | 13 | -356/+324 |
| | | |||||
* | | allow passing description+extra to batch endpoints | Bryan Newbold | 2019-01-22 | 14 | -143/+638 |
| | | | | | | | | | | | | Pretty messy, but I needed some way to do this. In particular, requires json.dumps() in python code, for now. Blech. | ||||
* | | refactored crossref importer to new style | Bryan Newbold | 2019-01-22 | 5 | -118/+198 |
| | | |||||
* | | new importer API interfaces | Bryan Newbold | 2019-01-22 | 2 | -0/+181 |
| | | |||||
* | | crossref importer updates | Bryan Newbold | 2019-01-22 | 4 | -22/+82 |
| | | |||||
* | | add helper/hack script to generate bots | Bryan Newbold | 2019-01-22 | 1 | -0/+25 |
| | | |||||
* | | pubmed+datacite tokens; no journal,grobid,matched tokens | Bryan Newbold | 2019-01-22 | 2 | -5/+4 |
| | | |||||
* | | fix issn -> journal-metadata rename | Bryan Newbold | 2019-01-22 | 1 | -1/+1 |
| | | |||||
* | | MAG schema notes | Bryan Newbold | 2019-01-22 | 1 | -0/+65 |
| | | |||||
* | | 2019-01-16 QA import timing notes | Bryan Newbold | 2019-01-22 | 1 | -0/+422 |
| | | |||||
* | | more per-entity tests | Bryan Newbold | 2019-01-22 | 7 | -58/+312 |
| | | |||||
* | | add missing arxiv+jstor id indices | Bryan Newbold | 2019-01-22 | 1 | -0/+2 |
| | | |||||
* | | allow arxiv and jstor lookups | Bryan Newbold | 2019-01-21 | 12 | -13/+106 |
| | | |||||
* | | remove coden and abbrev from python tools | Bryan Newbold | 2019-01-21 | 2 | -8/+0 |
| | |