summaryrefslogtreecommitdiffstats
path: root/fatcat_scholar
Commit message (Collapse)AuthorAgeFilesLines
* add prefix scrubing (esp. for abstracts)Bryan Newbold2020-05-211-0/+18
|
* use beautiful soup for XML scrubingBryan Newbold2020-05-211-7/+6
|
* make mypy happyBryan Newbold2020-05-211-1/+1
|
* implement crude availability filterBryan Newbold2020-05-211-0/+11
|
* fix typo in indexed document linksBryan Newbold2020-05-211-1/+1
|
* be more inclusive of author namesBryan Newbold2020-05-211-4/+4
|
* fix abstracts; experiment with search stemmingBryan Newbold2020-05-211-2/+4
|
* first pass improving search scoringBryan Newbold2020-05-211-5/+35
|
* better translation marking; add some basic de and zhBryan Newbold2020-05-219-55/+371
| | | | Current translations are just from Google Translate
* UI mobile/tablet scaling; search error improvementsBryan Newbold2020-05-215-12/+42
|
* mobile CSS/style changes, and other small UI tweaksBryan Newbold2020-05-215-42/+100
|
* fix typo with UnicodeDecodeError catchBryan Newbold2020-05-211-1/+1
|
* clean up domain/env detection codeBryan Newbold2020-05-212-28/+11
|
* search query improvementsBryan Newbold2020-05-215-145/+236
| | | | | | - wire up most of the filters and sort order - query sticks around in search box - crude error message (needs work)
* less whitespace in jinja2 outputBryan Newbold2020-05-211-0/+4
|
* skip pdftotext loading on unicode errorBryan Newbold2020-05-201-0/+2
|
* skip SIM items w/o page_numbers (instead of asserting)Bryan Newbold2020-05-202-2/+6
|
* fewer, longer highlights (2x of 250 chars)Bryan Newbold2020-05-201-4/+4
|
* add a helper tag for search index documentBryan Newbold2020-05-201-1/+5
|
* fix some ext_id linksBryan Newbold2020-05-201-4/+4
|
* fixes from manual testingBryan Newbold2020-05-205-23/+31
|
* local pdftotext cache dir hackBryan Newbold2020-05-201-1/+18
|
* fixes to release+sim pipelineBryan Newbold2020-05-203-12/+39
|
* indexing tweaksBryan Newbold2020-05-202-16/+11
|
* update search template for schemaBryan Newbold2020-05-201-129/+95
|
* first pass transform from pipelines to ES schemaBryan Newbold2020-05-205-26/+540
|
* WIP on SIM pipelineBryan Newbold2020-05-192-2/+175
|
* WIP on release-to-sim fetchingBryan Newbold2020-05-192-12/+124
|
* initial progress on work pipelineBryan Newbold2020-05-162-0/+336
|
* hack-y global serde ApiClientBryan Newbold2020-05-161-2/+4
| | | | | Motivation is to speed up serialization/deserialization for entity encoding.
* crude djvu XML parsingBryan Newbold2020-05-161-0/+49
|
* basic biblio converterBryan Newbold2020-05-161-7/+120
|
* tweak ES schema fields a bitBryan Newbold2020-05-161-3/+4
|
* more progress on issue_dbBryan Newbold2020-05-161-28/+47
|
* first pass at issue-db toolBryan Newbold2020-05-151-0/+286
|
* start implementing ES transform helpersBryan Newbold2020-05-142-0/+236
|
* style tweaksBryan Newbold2020-05-133-19/+34
|
* skeleton of basic search, using covid19 indexBryan Newbold2020-05-139-18/+751
|
* very hack-y i18n support in jinja2 templatesBryan Newbold2020-05-127-16/+189
|
* fastapi infrastructureBryan Newbold2020-05-124-0/+88
- API/web distrinction - language code prefixes - content negotiation for endpoints