Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | improve text scrubbing | Bryan Newbold | 2020-06-03 | 1 | -13/+21 | |
| | | | | | | | | | | Was going to use textpipe, but dependency was too large and failed to install with halfway modern GCC (due to CLD2 issue): https://github.com/GregBowyer/cld2-cffi/issues/12 So instead basically pulled out the clean_text function, which is quite short. | |||||
* | partially resolve HTML form/hidden weirdness | Bryan Newbold | 2020-06-03 | 2 | -3/+13 | |
| | ||||||
* | basic pagination | Bryan Newbold | 2020-06-03 | 2 | -0/+27 | |
| | | | | Not well tested | |||||
* | tweak thumbnail vertical alignment | Bryan Newbold | 2020-06-03 | 1 | -1/+1 | |
| | ||||||
* | compute and use tags | Bryan Newbold | 2020-06-03 | 2 | -2/+42 | |
| | ||||||
* | add additional fatcat metadata tag | Bryan Newbold | 2020-06-03 | 1 | -5/+11 | |
| | ||||||
* | start fleshing out /about and /help | Bryan Newbold | 2020-06-03 | 4 | -12/+152 | |
| | ||||||
* | replace one of the PLOS thumbnails on homepage | Bryan Newbold | 2020-06-03 | 1 | -10/+11 | |
| | ||||||
* | change availability filter phrasing; default to fulltext | Bryan Newbold | 2020-06-03 | 1 | -6/+6 | |
| | ||||||
* | tweak search box placeholder text | Bryan Newbold | 2020-06-03 | 1 | -1/+1 | |
| | ||||||
* | most links in new tab (target=_blank) | Bryan Newbold | 2020-06-03 | 4 | -30/+30 | |
| | ||||||
* | more petabox timeout handling | Bryan Newbold | 2020-05-21 | 2 | -0/+6 | |
| | ||||||
* | handle petabox read timeouts a bit | Bryan Newbold | 2020-05-21 | 2 | -2/+12 | |
| | ||||||
* | add prefix scrubing (esp. for abstracts) | Bryan Newbold | 2020-05-21 | 1 | -0/+18 | |
| | ||||||
* | use beautiful soup for XML scrubing | Bryan Newbold | 2020-05-21 | 1 | -7/+6 | |
| | ||||||
* | make mypy happy | Bryan Newbold | 2020-05-21 | 1 | -1/+1 | |
| | ||||||
* | implement crude availability filter | Bryan Newbold | 2020-05-21 | 1 | -0/+11 | |
| | ||||||
* | fix typo in indexed document links | Bryan Newbold | 2020-05-21 | 1 | -1/+1 | |
| | ||||||
* | be more inclusive of author names | Bryan Newbold | 2020-05-21 | 1 | -4/+4 | |
| | ||||||
* | fix abstracts; experiment with search stemming | Bryan Newbold | 2020-05-21 | 1 | -2/+4 | |
| | ||||||
* | first pass improving search scoring | Bryan Newbold | 2020-05-21 | 1 | -5/+35 | |
| | ||||||
* | better translation marking; add some basic de and zh | Bryan Newbold | 2020-05-21 | 9 | -55/+371 | |
| | | | | Current translations are just from Google Translate | |||||
* | UI mobile/tablet scaling; search error improvements | Bryan Newbold | 2020-05-21 | 5 | -12/+42 | |
| | ||||||
* | mobile CSS/style changes, and other small UI tweaks | Bryan Newbold | 2020-05-21 | 5 | -42/+100 | |
| | ||||||
* | fix typo with UnicodeDecodeError catch | Bryan Newbold | 2020-05-21 | 1 | -1/+1 | |
| | ||||||
* | clean up domain/env detection code | Bryan Newbold | 2020-05-21 | 2 | -28/+11 | |
| | ||||||
* | search query improvements | Bryan Newbold | 2020-05-21 | 5 | -145/+236 | |
| | | | | | | - wire up most of the filters and sort order - query sticks around in search box - crude error message (needs work) | |||||
* | less whitespace in jinja2 output | Bryan Newbold | 2020-05-21 | 1 | -0/+4 | |
| | ||||||
* | skip pdftotext loading on unicode error | Bryan Newbold | 2020-05-20 | 1 | -0/+2 | |
| | ||||||
* | skip SIM items w/o page_numbers (instead of asserting) | Bryan Newbold | 2020-05-20 | 2 | -2/+6 | |
| | ||||||
* | fewer, longer highlights (2x of 250 chars) | Bryan Newbold | 2020-05-20 | 1 | -4/+4 | |
| | ||||||
* | add a helper tag for search index document | Bryan Newbold | 2020-05-20 | 1 | -1/+5 | |
| | ||||||
* | fix some ext_id links | Bryan Newbold | 2020-05-20 | 1 | -4/+4 | |
| | ||||||
* | fixes from manual testing | Bryan Newbold | 2020-05-20 | 5 | -23/+31 | |
| | ||||||
* | local pdftotext cache dir hack | Bryan Newbold | 2020-05-20 | 1 | -1/+18 | |
| | ||||||
* | fixes to release+sim pipeline | Bryan Newbold | 2020-05-20 | 3 | -12/+39 | |
| | ||||||
* | indexing tweaks | Bryan Newbold | 2020-05-20 | 2 | -16/+11 | |
| | ||||||
* | update search template for schema | Bryan Newbold | 2020-05-20 | 1 | -129/+95 | |
| | ||||||
* | first pass transform from pipelines to ES schema | Bryan Newbold | 2020-05-20 | 5 | -26/+540 | |
| | ||||||
* | WIP on SIM pipeline | Bryan Newbold | 2020-05-19 | 2 | -2/+175 | |
| | ||||||
* | WIP on release-to-sim fetching | Bryan Newbold | 2020-05-19 | 2 | -12/+124 | |
| | ||||||
* | initial progress on work pipeline | Bryan Newbold | 2020-05-16 | 2 | -0/+336 | |
| | ||||||
* | hack-y global serde ApiClient | Bryan Newbold | 2020-05-16 | 1 | -2/+4 | |
| | | | | | Motivation is to speed up serialization/deserialization for entity encoding. | |||||
* | crude djvu XML parsing | Bryan Newbold | 2020-05-16 | 1 | -0/+49 | |
| | ||||||
* | basic biblio converter | Bryan Newbold | 2020-05-16 | 1 | -7/+120 | |
| | ||||||
* | tweak ES schema fields a bit | Bryan Newbold | 2020-05-16 | 1 | -3/+4 | |
| | ||||||
* | more progress on issue_db | Bryan Newbold | 2020-05-16 | 1 | -28/+47 | |
| | ||||||
* | first pass at issue-db tool | Bryan Newbold | 2020-05-15 | 1 | -0/+286 | |
| | ||||||
* | start implementing ES transform helpers | Bryan Newbold | 2020-05-14 | 2 | -0/+236 | |
| | ||||||
* | style tweaks | Bryan Newbold | 2020-05-13 | 3 | -19/+34 | |
| |