Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | use container_name, not container_ident, in boost | Bryan Newbold | 2020-08-12 | 1 | -1/+1 | |
| | | | | | This should result in SIM page fulltext matches not getting pushed down as much, as well as things like biorxiv (*rxiv) results. | |||||
* | use simple names, not domain names, for some platforms | Bryan Newbold | 2020-08-12 | 1 | -3/+3 | |
| | ||||||
* | more scaling notes | Bryan Newbold | 2020-08-12 | 1 | -0/+97 | |
| | ||||||
* | fmt/lint tweaks | Bryan Newbold | 2020-08-12 | 2 | -6/+2 | |
| | ||||||
* | another clean_str() test case | Bryan Newbold | 2020-08-12 | 1 | -0/+4 | |
| | ||||||
* | biblio metadata hacks at transform time | Bryan Newbold | 2020-08-12 | 1 | -2/+98 | |
| | ||||||
* | transform: more string cleaning | Bryan Newbold | 2020-08-12 | 2 | -13/+78 | |
| | ||||||
* | search: include 'article' in papers filter | Bryan Newbold | 2020-08-12 | 1 | -1/+1 | |
| | ||||||
* | search: use simplified query for highlighting | Bryan Newbold | 2020-08-12 | 1 | -1/+8 | |
| | | | | | | | | This fixes broken phrase query highlighting. I found this issues but it may have been unrelated: https://github.com/elastic/elasticsearch/issues/40227 | |||||
* | docker: bump ES to 7.8 for local dev | Bryan Newbold | 2020-08-12 | 1 | -1/+1 | |
| | ||||||
* | makefile: add dev-prod command for local dev against prod search index | Bryan Newbold | 2020-08-12 | 1 | -0/+4 | |
| | ||||||
* | don't print config; make fmt | Bryan Newbold | 2020-08-06 | 1 | -3/+7 | |
| | ||||||
* | re-use ES sync API client | Bryan Newbold | 2020-08-06 | 1 | -3/+4 | |
| | ||||||
* | 'more versions' dropdown table | Bryan Newbold | 2020-08-06 | 1 | -0/+82 | |
| | ||||||
* | small HTML simplifications | Bryan Newbold | 2020-08-06 | 1 | -6/+6 | |
| | ||||||
* | report ES API query time as server-timing header | Bryan Newbold | 2020-08-06 | 2 | -1/+13 | |
| | ||||||
* | squish collapse button in with tags | Bryan Newbold | 2020-08-06 | 1 | -8/+7 | |
| | ||||||
* | have search buttons animate after submit | Bryan Newbold | 2020-08-06 | 2 | -3/+10 | |
| | | | | Extremely minimal javascript used | |||||
* | add debug mode flag (to control json tag/link) | Bryan Newbold | 2020-08-06 | 3 | -5/+11 | |
| | ||||||
* | slightly more padding in SERP box at max screen size | Bryan Newbold | 2020-08-06 | 2 | -1/+4 | |
| | ||||||
* | remove javascript includes | Bryan Newbold | 2020-08-06 | 1 | -0/+4 | |
| | ||||||
* | basic placeholder thumbnail image | Bryan Newbold | 2020-08-06 | 3 | -3/+191 | |
| | ||||||
* | sort tags, and show JSTOR as a color tag | Bryan Newbold | 2020-08-06 | 1 | -1/+3 | |
| | ||||||
* | show language code as a tag | Bryan Newbold | 2020-08-06 | 2 | -2/+7 | |
| | ||||||
* | set HTML language to locale correctly | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | ||||||
* | don't index sim_page without issue_item and first_page | Bryan Newbold | 2020-08-06 | 1 | -0/+3 | |
| | ||||||
* | volume_int/issue_int as actual ints | Bryan Newbold | 2020-08-06 | 1 | -2/+2 | |
| | ||||||
* | make fmt | Bryan Newbold | 2020-08-06 | 1 | -14/+14 | |
| | ||||||
* | ES schema: access_type should be any option, not just 'best' | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | ||||||
* | handle integer conversion and bounding for ES schema | Bryan Newbold | 2020-08-06 | 2 | -19/+35 | |
| | ||||||
* | microfilm access filter; broader access matching | Bryan Newbold | 2020-08-06 | 1 | -3/+6 | |
| | ||||||
* | handle longer query times | Bryan Newbold | 2020-08-06 | 1 | -2/+10 | |
| | ||||||
* | scrub_text: single-token strings skipped | Bryan Newbold | 2020-08-06 | 2 | -1/+5 | |
| | ||||||
* | strip ACKNOWLEDGEMENTS prefix | Bryan Newbold | 2020-08-06 | 1 | -0/+1 | |
| | ||||||
* | fix acknowledgement highlighting (typo) | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | ||||||
* | more notes on scaling | Bryan Newbold | 2020-08-06 | 1 | -0/+363 | |
| | ||||||
* | reduce title boost; use only base query for highlighting | Bryan Newbold | 2020-08-06 | 1 | -1/+2 | |
| | ||||||
* | special case '*' queries | Bryan Newbold | 2020-08-06 | 1 | -6/+16 | |
| | | | | | More/better query parsing in the client could detect if this was a "filter only" query and do the same kind of optimization. | |||||
* | remove 'title' from poor metadata scoring | Bryan Newbold | 2020-08-06 | 1 | -1/+0 | |
| | ||||||
* | better time ranges (don't search future) | Bryan Newbold | 2020-08-06 | 1 | -4/+7 | |
| | ||||||
* | add title back to match query | Bryan Newbold | 2020-08-06 | 1 | -0/+1 | |
| | ||||||
* | enable index_phrases on everything, biblio_all, title_all | Bryan Newbold | 2020-08-06 | 1 | -5/+3 | |
| | | | | | Want phrase queries to be faster. Expect this to increase term index size, requiring more disk space. | |||||
* | ES schema: do not index fulltext.body or fulltext.annex separately from ↵ | Bryan Newbold | 2020-08-06 | 1 | -3/+2 | |
| | | | | | | | | 'everything' The goal here is to reduce term index size. This means that querying/matching only on these fields (distinct from "everything") will not work. | |||||
* | ES schema: use smaller integer size (short) for most numbers | Bryan Newbold | 2020-08-06 | 1 | -5/+5 | |
| | ||||||
* | ES schema: copy_to titles into single title_all field | Bryan Newbold | 2020-08-06 | 1 | -4/+4 | |
| | ||||||
* | query fewer fields; highlight all fulltext fields regardless of match | Bryan Newbold | 2020-08-06 | 1 | -3/+1 | |
| | ||||||
* | fix typo in SERP page macro | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | ||||||
* | search tweaks to be forwards-compatible with ES 7.x | Bryan Newbold | 2020-08-06 | 1 | -2/+10 | |
| | | | | | | When we fully commit to ES 7.x we should upgrade the client library correspondingly, and then can remove these work-arounds. But for now we have one instance of ES 6.x and one ES 7.x. | |||||
* | extend ES client timeout to 25 seconds | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | ||||||
* | fix display of papers missing fulltext | Bryan Newbold | 2020-08-06 | 1 | -1/+1 | |
| | | | | | | I think the bug happened now that we do not serialize the pydantic structures with empty values. A better solution might be to deserialize search hits into pydantic objects before rendering. |