aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* fix rel/url order swapBryan Newbold2019-01-281-1/+1
|
* remove accidental print in release transformBryan Newbold2019-01-281-1/+0
|
* more ES index name updatesBryan Newbold2019-01-281-2/+3
|
* Fix Cargo version pinsBryan Newbold2019-01-282-4/+4
|
* don't allow empty or single-character clean stringsBryan Newbold2019-01-281-1/+1
|
* update export READMEBryan Newbold2019-01-281-6/+7
|
* fix tests/cli.shBryan Newbold2019-01-281-1/+1
|
* update rust Cargo lock (from version bump)Bryan Newbold2019-01-281-5/+5
|
* filter short/stub original_titleBryan Newbold2019-01-281-3/+7
|
* fix typo in container transformBryan Newbold2019-01-281-1/+1
|
* fixes to transform codeBryan Newbold2019-01-281-9/+11
|
* fix rust comment typosBryan Newbold2019-01-281-1/+1
|
* bump versions (0.1.0 to 0.2.0)Bryan Newbold2019-01-2864-68/+68
|
* add quick test for WARN rust loggingBryan Newbold2019-01-281-0/+12
|
* rustfmtBryan Newbold2019-01-281-3/+6
|
* get regular log lines flowing through slogBryan Newbold2019-01-283-1/+52
| | | | | The plan here is to replace slog with env_logger (much simpler!), but not there yet.
* wrote a whole thing to log Warning headers and not neededBryan Newbold2019-01-281-0/+3
|
* many fixes in GROBID importerBryan Newbold2019-01-281-14/+10
|
* fix matched test vectorBryan Newbold2019-01-281-1/+1
| | | | this was resulting in a collision with default/example database objects.
* fix GROBID null/short abstract additionsBryan Newbold2019-01-281-1/+2
|
* batch size as a general import paramBryan Newbold2019-01-281-13/+4
|
* add missing bezerk-mode flag to GROBID importBryan Newbold2019-01-281-3/+8
|
* squelch some rust unused importsBryan Newbold2019-01-282-2/+0
|
* enforce title len>1 for release importsBryan Newbold2019-01-282-1/+8
|
* fix typo in crossref importerBryan Newbold2019-01-281-1/+1
|
* drop creators with no display name at allBryan Newbold2019-01-281-3/+3
|
* make ORCID importer skip no-names, not assertBryan Newbold2019-01-281-1/+2
|
* more ES index fixesBryan Newbold2019-01-283-3/+4
|
* vastly improve entity_to_dict() speedBryan Newbold2019-01-281-1/+9
|
* add filesets and webcaptures to dumpsBryan Newbold2019-01-285-2/+35
|
* fatcat -> fatcat_release ES indexBryan Newbold2019-01-283-20/+21
|
* transform and import fixes/tweaksBryan Newbold2019-01-258-30/+214
|
* improved journal metadata mungerBryan Newbold2019-01-252-100/+325
|
* tweak elastic schemas (again)Bryan Newbold2019-01-252-6/+4
|
* update journal meta import/transformBryan Newbold2019-01-256-154/+226
|
* grobid import extra metadata tweaksBryan Newbold2019-01-241-6/+7
|
* refactor _get_editgroup => get_editgroup_idBryan Newbold2019-01-242-5/+6
|
* refactor make_rel_urlBryan Newbold2019-01-243-29/+66
|
* examples of works with many authors (eg, used in tests)Bryan Newbold2019-01-241-0/+6
|
* tweak crossref import, and update testsBryan Newbold2019-01-245-32/+89
|
* empty fields testBryan Newbold2019-01-241-0/+13
|
* allow importing contrib/refs listsBryan Newbold2019-01-244-13/+50
| | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists.
* codegen schema tweaksBryan Newbold2019-01-245-20/+61
|
* Merge branch 'schema-tweaks'Bryan Newbold2019-01-241-16/+8
|\
| * more IDENT types in API schemaBryan Newbold2019-01-141-16/+8
| |
* | more 2019-01-16 import timingBryan Newbold2019-01-241-0/+70
| |
* | notes on refactoring container 'extra'Bryan Newbold2019-01-241-0/+79
| |
* | first-pass journal metadata mungerBryan Newbold2019-01-245-0/+512
| |
* | importer bugfixesBryan Newbold2019-01-233-8/+14
| |
* | more import script fixesBryan Newbold2019-01-231-1/+4
| |