aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* skip pdftotext loading on unicode errorBryan Newbold2020-05-201-0/+2
* skip SIM items w/o page_numbers (instead of asserting)Bryan Newbold2020-05-202-2/+6
* fewer, longer highlights (2x of 250 chars)Bryan Newbold2020-05-201-4/+4
* schema: releases as objects, not nestedBryan Newbold2020-05-201-1/+1
* schema: many more aliasesBryan Newbold2020-05-201-1/+19
* add a helper tag for search index documentBryan Newbold2020-05-201-1/+5
* fix some ext_id linksBryan Newbold2020-05-201-4/+4
* fixes from manual testingBryan Newbold2020-05-206-25/+33
* local pdftotext cache dir hackBryan Newbold2020-05-202-1/+19
* fixes to release+sim pipelineBryan Newbold2020-05-203-12/+39
* working docker-compose with elasticsearch (with plugins)Bryan Newbold2020-05-202-0/+24
* fixes to schema; actually working nowBryan Newbold2020-05-201-3/+4
* default search locations for different environmentsBryan Newbold2020-05-201-1/+4
* local/dev indexing commandBryan Newbold2020-05-201-8/+8
* indexing tweaksBryan Newbold2020-05-202-16/+11
* update search template for schemaBryan Newbold2020-05-201-129/+95
* first pass transform from pipelines to ES schemaBryan Newbold2020-05-206-27/+541
* WIP on SIM pipelineBryan Newbold2020-05-192-2/+175
* WIP on release-to-sim fetchingBryan Newbold2020-05-192-12/+124
* pytest: squelch ABC warning (from internetarchive)Bryan Newbold2020-05-161-0/+1
* initial progress on work pipelineBryan Newbold2020-05-163-2/+338
* hack-y global serde ApiClientBryan Newbold2020-05-161-2/+4
* crude djvu XML parsingBryan Newbold2020-05-163-0/+5207
* basic biblio converterBryan Newbold2020-05-162-8/+130
* initial gitlab-ci fileBryan Newbold2020-05-161-0/+18
* tweak ES schema fields a bitBryan Newbold2020-05-163-19/+32
* more progress on issue_dbBryan Newbold2020-05-162-28/+60
* gitkeep ./data/ directoryBryan Newbold2020-05-162-0/+2
* first pass at issue-db toolBryan Newbold2020-05-152-0/+321
* pipenv: update fatcat-openapi-client; pydanticBryan Newbold2020-05-142-39/+39
* ES index and makefile targetBryan Newbold2020-05-142-1/+18
* working pytest settingsBryan Newbold2020-05-142-1/+14
* start implementing ES transform helpersBryan Newbold2020-05-144-0/+256
* first pass at scholar_fulltext schemaBryan Newbold2020-05-142-141/+216
* schema starting point (from covid19)Bryan Newbold2020-05-131-0/+141
* small Makefile best practice (?) tweakBryan Newbold2020-05-131-0/+3
* style tweaksBryan Newbold2020-05-133-19/+34
* skeleton of basic search, using covid19 indexBryan Newbold2020-05-1310-18/+758
* translation notes in READMEBryan Newbold2020-05-131-0/+22
* pipenv: dynaconfBryan Newbold2020-05-132-1/+24
* Makefile; pipenv add gunicornBryan Newbold2020-05-123-1/+29
* very hack-y i18n support in jinja2 templatesBryan Newbold2020-05-129-16/+220
* fastapi infrastructureBryan Newbold2020-05-124-0/+88
* pipenv: first passBryan Newbold2020-05-122-0/+847
* start sketching proposalsBryan Newbold2020-05-115-0/+287
* background reading linksBryan Newbold2020-05-111-0/+49
* init repoBryan Newbold2020-05-112-0/+25