Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | abstracts as object, not nested, until query parser | Bryan Newbold | 2020-05-21 | 1 | -5/+3 |
| | |||||
* | skip pdftotext loading on unicode error | Bryan Newbold | 2020-05-20 | 1 | -0/+2 |
| | |||||
* | skip SIM items w/o page_numbers (instead of asserting) | Bryan Newbold | 2020-05-20 | 2 | -2/+6 |
| | |||||
* | fewer, longer highlights (2x of 250 chars) | Bryan Newbold | 2020-05-20 | 1 | -4/+4 |
| | |||||
* | schema: releases as objects, not nested | Bryan Newbold | 2020-05-20 | 1 | -1/+1 |
| | | | | | | With nested, we can't do simple aliases. In the future a proper query parser will make this possible. | ||||
* | schema: many more aliases | Bryan Newbold | 2020-05-20 | 1 | -1/+19 |
| | |||||
* | add a helper tag for search index document | Bryan Newbold | 2020-05-20 | 1 | -1/+5 |
| | |||||
* | fix some ext_id links | Bryan Newbold | 2020-05-20 | 1 | -4/+4 |
| | |||||
* | fixes from manual testing | Bryan Newbold | 2020-05-20 | 6 | -25/+33 |
| | |||||
* | local pdftotext cache dir hack | Bryan Newbold | 2020-05-20 | 2 | -1/+19 |
| | |||||
* | fixes to release+sim pipeline | Bryan Newbold | 2020-05-20 | 3 | -12/+39 |
| | |||||
* | working docker-compose with elasticsearch (with plugins) | Bryan Newbold | 2020-05-20 | 2 | -0/+24 |
| | |||||
* | fixes to schema; actually working now | Bryan Newbold | 2020-05-20 | 1 | -3/+4 |
| | |||||
* | default search locations for different environments | Bryan Newbold | 2020-05-20 | 1 | -1/+4 |
| | |||||
* | local/dev indexing command | Bryan Newbold | 2020-05-20 | 1 | -8/+8 |
| | |||||
* | indexing tweaks | Bryan Newbold | 2020-05-20 | 2 | -16/+11 |
| | |||||
* | update search template for schema | Bryan Newbold | 2020-05-20 | 1 | -129/+95 |
| | |||||
* | first pass transform from pipelines to ES schema | Bryan Newbold | 2020-05-20 | 6 | -27/+541 |
| | |||||
* | WIP on SIM pipeline | Bryan Newbold | 2020-05-19 | 2 | -2/+175 |
| | |||||
* | WIP on release-to-sim fetching | Bryan Newbold | 2020-05-19 | 2 | -12/+124 |
| | |||||
* | pytest: squelch ABC warning (from internetarchive) | Bryan Newbold | 2020-05-16 | 1 | -0/+1 |
| | |||||
* | initial progress on work pipeline | Bryan Newbold | 2020-05-16 | 3 | -2/+338 |
| | |||||
* | hack-y global serde ApiClient | Bryan Newbold | 2020-05-16 | 1 | -2/+4 |
| | | | | | Motivation is to speed up serialization/deserialization for entity encoding. | ||||
* | crude djvu XML parsing | Bryan Newbold | 2020-05-16 | 3 | -0/+5207 |
| | |||||
* | basic biblio converter | Bryan Newbold | 2020-05-16 | 2 | -8/+130 |
| | |||||
* | initial gitlab-ci file | Bryan Newbold | 2020-05-16 | 1 | -0/+18 |
| | |||||
* | tweak ES schema fields a bit | Bryan Newbold | 2020-05-16 | 3 | -19/+32 |
| | |||||
* | more progress on issue_db | Bryan Newbold | 2020-05-16 | 2 | -28/+60 |
| | |||||
* | gitkeep ./data/ directory | Bryan Newbold | 2020-05-16 | 2 | -0/+2 |
| | |||||
* | first pass at issue-db tool | Bryan Newbold | 2020-05-15 | 2 | -0/+321 |
| | |||||
* | pipenv: update fatcat-openapi-client; pydantic | Bryan Newbold | 2020-05-14 | 2 | -39/+39 |
| | |||||
* | ES index and makefile target | Bryan Newbold | 2020-05-14 | 2 | -1/+18 |
| | |||||
* | working pytest settings | Bryan Newbold | 2020-05-14 | 2 | -1/+14 |
| | |||||
* | start implementing ES transform helpers | Bryan Newbold | 2020-05-14 | 4 | -0/+256 |
| | |||||
* | first pass at scholar_fulltext schema | Bryan Newbold | 2020-05-14 | 2 | -141/+216 |
| | |||||
* | schema starting point (from covid19) | Bryan Newbold | 2020-05-13 | 1 | -0/+141 |
| | |||||
* | small Makefile best practice (?) tweak | Bryan Newbold | 2020-05-13 | 1 | -0/+3 |
| | |||||
* | style tweaks | Bryan Newbold | 2020-05-13 | 3 | -19/+34 |
| | |||||
* | skeleton of basic search, using covid19 index | Bryan Newbold | 2020-05-13 | 10 | -18/+758 |
| | |||||
* | translation notes in README | Bryan Newbold | 2020-05-13 | 1 | -0/+22 |
| | |||||
* | pipenv: dynaconf | Bryan Newbold | 2020-05-13 | 2 | -1/+24 |
| | |||||
* | Makefile; pipenv add gunicorn | Bryan Newbold | 2020-05-12 | 3 | -1/+29 |
| | |||||
* | very hack-y i18n support in jinja2 templates | Bryan Newbold | 2020-05-12 | 9 | -16/+220 |
| | |||||
* | fastapi infrastructure | Bryan Newbold | 2020-05-12 | 4 | -0/+88 |
| | | | | | | - API/web distrinction - language code prefixes - content negotiation for endpoints | ||||
* | pipenv: first pass | Bryan Newbold | 2020-05-12 | 2 | -0/+847 |
| | |||||
* | start sketching proposals | Bryan Newbold | 2020-05-11 | 5 | -0/+287 |
| | |||||
* | background reading links | Bryan Newbold | 2020-05-11 | 1 | -0/+49 |
| | |||||
* | init repo | Bryan Newbold | 2020-05-11 | 2 | -0/+25 |