index
:
fatcat-scholar
bnewbold-jammy
debug-no-i18n
master
x-attic-gitlab-a11y
x-attic-rescore
Unnamed repository; edit this file 'description' to name the repository.
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
commit prototype pipeline notes (in README)
Bryan Newbold
2020-06-03
1
-0
/
+47
*
more petabox timeout handling
Bryan Newbold
2020-05-21
2
-0
/
+6
*
handle petabox read timeouts a bit
Bryan Newbold
2020-05-21
2
-2
/
+12
*
HTML strip in ES indexing
Bryan Newbold
2020-05-21
1
-4
/
+4
*
add prefix scrubing (esp. for abstracts)
Bryan Newbold
2020-05-21
1
-0
/
+18
*
use beautiful soup for XML scrubing
Bryan Newbold
2020-05-21
1
-7
/
+6
*
make mypy happy
Bryan Newbold
2020-05-21
1
-1
/
+1
*
helpers to fetch small-ish data samples
Bryan Newbold
2020-05-21
1
-0
/
+8
*
implement crude availability filter
Bryan Newbold
2020-05-21
1
-0
/
+11
*
fix typo in indexed document links
Bryan Newbold
2020-05-21
1
-1
/
+1
*
be more inclusive of author names
Bryan Newbold
2020-05-21
1
-4
/
+4
*
fix abstracts; experiment with search stemming
Bryan Newbold
2020-05-21
3
-8
/
+36
*
first pass improving search scoring
Bryan Newbold
2020-05-21
2
-5
/
+36
*
better translation marking; add some basic de and zh
Bryan Newbold
2020-05-21
12
-61
/
+534
*
UI mobile/tablet scaling; search error improvements
Bryan Newbold
2020-05-21
5
-12
/
+42
*
mobile CSS/style changes, and other small UI tweaks
Bryan Newbold
2020-05-21
5
-42
/
+100
*
fix typo with UnicodeDecodeError catch
Bryan Newbold
2020-05-21
1
-1
/
+1
*
clean up domain/env detection code
Bryan Newbold
2020-05-21
3
-29
/
+16
*
search query improvements
Bryan Newbold
2020-05-21
5
-145
/
+236
*
less whitespace in jinja2 output
Bryan Newbold
2020-05-21
1
-0
/
+4
*
abstracts as object, not nested, until query parser
Bryan Newbold
2020-05-21
1
-5
/
+3
*
skip pdftotext loading on unicode error
Bryan Newbold
2020-05-20
1
-0
/
+2
*
skip SIM items w/o page_numbers (instead of asserting)
Bryan Newbold
2020-05-20
2
-2
/
+6
*
fewer, longer highlights (2x of 250 chars)
Bryan Newbold
2020-05-20
1
-4
/
+4
*
schema: releases as objects, not nested
Bryan Newbold
2020-05-20
1
-1
/
+1
*
schema: many more aliases
Bryan Newbold
2020-05-20
1
-1
/
+19
*
add a helper tag for search index document
Bryan Newbold
2020-05-20
1
-1
/
+5
*
fix some ext_id links
Bryan Newbold
2020-05-20
1
-4
/
+4
*
fixes from manual testing
Bryan Newbold
2020-05-20
6
-25
/
+33
*
local pdftotext cache dir hack
Bryan Newbold
2020-05-20
2
-1
/
+19
*
fixes to release+sim pipeline
Bryan Newbold
2020-05-20
3
-12
/
+39
*
working docker-compose with elasticsearch (with plugins)
Bryan Newbold
2020-05-20
2
-0
/
+24
*
fixes to schema; actually working now
Bryan Newbold
2020-05-20
1
-3
/
+4
*
default search locations for different environments
Bryan Newbold
2020-05-20
1
-1
/
+4
*
local/dev indexing command
Bryan Newbold
2020-05-20
1
-8
/
+8
*
indexing tweaks
Bryan Newbold
2020-05-20
2
-16
/
+11
*
update search template for schema
Bryan Newbold
2020-05-20
1
-129
/
+95
*
first pass transform from pipelines to ES schema
Bryan Newbold
2020-05-20
6
-27
/
+541
*
WIP on SIM pipeline
Bryan Newbold
2020-05-19
2
-2
/
+175
*
WIP on release-to-sim fetching
Bryan Newbold
2020-05-19
2
-12
/
+124
*
pytest: squelch ABC warning (from internetarchive)
Bryan Newbold
2020-05-16
1
-0
/
+1
*
initial progress on work pipeline
Bryan Newbold
2020-05-16
3
-2
/
+338
*
hack-y global serde ApiClient
Bryan Newbold
2020-05-16
1
-2
/
+4
*
crude djvu XML parsing
Bryan Newbold
2020-05-16
3
-0
/
+5207
*
basic biblio converter
Bryan Newbold
2020-05-16
2
-8
/
+130
*
initial gitlab-ci file
Bryan Newbold
2020-05-16
1
-0
/
+18
*
tweak ES schema fields a bit
Bryan Newbold
2020-05-16
3
-19
/
+32
*
more progress on issue_db
Bryan Newbold
2020-05-16
2
-28
/
+60
*
gitkeep ./data/ directory
Bryan Newbold
2020-05-16
2
-0
/
+2
*
first pass at issue-db tool
Bryan Newbold
2020-05-15
2
-0
/
+321
[prev]
[next]