aboutsummaryrefslogtreecommitdiffstats
path: root/tests
Commit message (Collapse)AuthorAgeFilesLines
* make fmtBryan Newbold2021-07-261-5/+13
|
* fix failing test after clean_doi()Bryan Newbold2021-07-261-1/+1
|
* refs transform: many fixesBryan Newbold2021-07-252-1/+274
| | | | | | | | | - include year correctly (many cases) - test coverage for Crossref transform - pass-through 'edition' as 'version' - series-title parsed in to title or container as appropriate - missing release stage - fix 0-index vs. 1-index ref index field
* refs transform: 1-index refs.index, not 0-indexBryan Newbold2021-07-251-1/+1
| | | | | | | | This was not matching expectations/schema of downstream refs pipeline (cgraph), and wasn't matching documented schema. Note care required when checking if the index is set, to distinguish between '0' and 'None' values.
* refs: include (source) release_stage in outputBryan Newbold2021-06-301-9/+18
|
* commit missing elastic get example JSON filesBryan Newbold2021-06-112-0/+174
|
* update citation_pdf_url HTML meta tag to new access URL styleBryan Newbold2021-06-111-0/+1
|
* update access redirect URL endpointsBryan Newbold2021-06-111-19/+20
|
* lint fixes, and run fmtBryan Newbold2021-06-021-4/+1
|
* add 'crossref' hydration to work pipelineBryan Newbold2021-06-021-0/+16
| | | | | | | | The immediate motivation is to include recent crossref refs in citation graph transforms. May also be valuable for researchers to have authoritative/publisher metadata in the bundle dumps.
* web: fixes to access redirect endpointsBryan Newbold2021-05-191-0/+11
|
* iterate on PDF redirect linksBryan Newbold2021-05-171-3/+41
|
* iterate on access redirects and landing page implementationBryan Newbold2021-04-272-0/+123
| | | | Small code refactors and minimal test coverage
* Revert undesirable changesChristian Clauss2021-02-236-11/+11
|
* Modernize Python syntax with pyupgrade --py38-plus **/*.pyChristian Clauss2021-02-236-11/+11
|
* api: handle null 'q' parameter on search endpointBryan Newbold2021-02-111-1/+5
|
* refactor ES configuration setting namesBryan Newbold2021-01-251-1/+1
|
* api: fix /search test, and mypy error on implementationBryan Newbold2021-01-151-1/+11
|
* add mocks to work pipeline testBryan Newbold2021-01-141-1/+63
|
* add regression test for uvloop+httptools uvicorn problemBryan Newbold2021-01-051-0/+11
|
* improve Accept-Language header parsingBryan Newbold2020-12-021-0/+4
|
* fmtBryan Newbold2020-10-281-1/+0
|
* fixes to issue_db testsBryan Newbold2020-10-231-6/+3
|
* basic web search testBryan Newbold2020-10-232-1/+1701
|
* basic test for issue-db pipelineBryan Newbold2020-10-233-0/+30
|
* start test coverage for web interfaceBryan Newbold2020-10-222-0/+68
|
* improve test coverageBryan Newbold2020-10-225-0/+72
|
* minimum viable tests for GROBID XML parsing and refs transformBryan Newbold2020-09-143-0/+535
|
* another clean_str() test caseBryan Newbold2020-08-121-0/+4
|
* transform: more string cleaningBryan Newbold2020-08-121-1/+19
|
* scrub_text: single-token strings skippedBryan Newbold2020-08-061-1/+1
|
* start some annotaition fixes for pytypeBryan Newbold2020-06-031-1/+1
|
* flake8-annotation lintingBryan Newbold2020-06-033-4/+4
| | | | Added some new annotations; need to finish more.
* flake8 fixes (partial)Bryan Newbold2020-06-032-3/+0
|
* reformat python code with blackBryan Newbold2020-06-033-13/+19
|
* improve text scrubbingBryan Newbold2020-06-031-0/+15
| | | | | | | | | | Was going to use textpipe, but dependency was too large and failed to install with halfway modern GCC (due to CLD2 issue): https://github.com/GregBowyer/cld2-cffi/issues/12 So instead basically pulled out the clean_text function, which is quite short.
* first pass transform from pipelines to ES schemaBryan Newbold2020-05-201-1/+1
|
* initial progress on work pipelineBryan Newbold2020-05-161-2/+2
|
* crude djvu XML parsingBryan Newbold2020-05-162-0/+5158
|
* basic biblio converterBryan Newbold2020-05-161-1/+10
|
* start implementing ES transform helpersBryan Newbold2020-05-142-0/+20