aboutsummaryrefslogtreecommitdiffstats
path: root/python
Commit message (Collapse)AuthorAgeFilesLines
* filter bad and duplicate URLs from webfaceBryan Newbold2018-10-121-0/+7
| | | | Will need to do this properly in importers/backend.
* point status links to uptimerobot deep urlsBryan Newbold2018-10-121-2/+2
|
* update README_import with GROBID commandBryan Newbold2018-10-121-0/+2
|
* fix grobid_metadata_importer test for nowBryan Newbold2018-10-121-7/+12
|
* don't filter out crossref works based on container-titleBryan Newbold2018-10-121-3/+3
|
* fixes for grobid metadata importerBryan Newbold2018-09-282-1/+16
|
* show only date component of datetimeBryan Newbold2018-09-281-1/+1
|
* sprinkle in TODOs for crossref importerBryan Newbold2018-09-281-0/+2
|
* more special crossrefBryan Newbold2018-09-281-1/+1
|
* special case crossref json viewBryan Newbold2018-09-281-1/+1
|
* fix extra json length checkBryan Newbold2018-09-281-1/+1
|
* try to fix reference displayBryan Newbold2018-09-281-1/+16
|
* try to fix contrib indexingBryan Newbold2018-09-281-1/+1
|
* try to shorten extra metadataBryan Newbold2018-09-281-1/+1
|
* hack to fix search results for nowBryan Newbold2018-09-281-2/+6
|
* try to handle attribution order index betterBryan Newbold2018-09-281-1/+1
|
* search creator name unicode work-aroundBryan Newbold2018-09-272-3/+5
|
* not all releases have dateBryan Newbold2018-09-271-1/+4
|
* improvements to grobid_metadata importerBryan Newbold2018-09-273-79/+211
| | | | | But still fails tests due to database collision/side-effect on sha1 lookup.
* move grobid metadata importer from sandcrawlerBryan Newbold2018-09-271-0/+93
|
* many tweaks to release_view pageBryan Newbold2018-09-261-48/+63
|
* improvements to search results pageBryan Newbold2018-09-261-5/+20
|
* switch search to new elastic schema (and endpoint)Bryan Newbold2018-09-263-17/+13
|
* have domain URLs be configurableBryan Newbold2018-09-268-11/+13
|
* expand release API JSON linksBryan Newbold2018-09-261-1/+1
|
* more crossref_importer fixes (ugh)Bryan Newbold2018-09-251-2/+11
|
* if we need to create a container for a release, accept itBryan Newbold2018-09-241-1/+4
| | | | | | This is working around bad (and undocumented) behavior around autoaccept batch imports, which resulted in hundreds of thousands of duplicate "unaccepted" containers in production.
* parse 'issued' date format correctlyBryan Newbold2018-09-241-1/+8
|
* update importer docs for newer crossref dumpBryan Newbold2018-09-241-1/+1
|
* use correct publication date in crossref_importer (oops)Bryan Newbold2018-09-241-1/+1
|
* even more conservative size restrictionsBryan Newbold2018-09-241-1/+1
|
* cut down max row count even furtherBryan Newbold2018-09-241-1/+1
|
* filter out huge (> 2500 row) release fieldsBryan Newbold2018-09-231-0/+5
|
* add author sequence metadata to contribsBryan Newbold2018-09-231-2/+6
|
* commit old notes and other filesBryan Newbold2018-09-222-0/+2
|
* more python example filesBryan Newbold2018-09-222-0/+424
|
* changes to crossref importer (and tests)Bryan Newbold2018-09-222-33/+127
|
* update python codegenBryan Newbold2018-09-227-364/+32
| | | | | Strip out TAGLINE (tag-related) lines from spec, so we don't get many small Api classes (just one top-level). Ugh.
* update import READMEBryan Newbold2018-09-201-80/+24
|
* fix issues with extid mapping in crossref-importerBryan Newbold2018-09-202-2/+2
|
* actually link to guideBryan Newbold2018-09-201-4/+4
|
* TODO in crossref_importerBryan Newbold2018-09-201-0/+1
|
* add (untested) fix for fulltext linkBryan Newbold2018-09-201-2/+2
|
* fix CHECK clausesBryan Newbold2018-09-171-0/+5
|
* fix matched_importerBryan Newbold2018-09-141-2/+2
|
* more matched testsBryan Newbold2018-09-141-0/+10
|
* switch manifest importer to be json-basedBryan Newbold2018-09-145-146/+14
|
* small typoBryan Newbold2018-09-141-1/+1
|
* fixes to matched importer (and a test)Bryan Newbold2018-09-143-0/+174
|
* tune how crossref imports are filteredBryan Newbold2018-09-141-4/+4
|