index
:
fatcat
bnewbold-doaj-article-harvest
bnewbold-elastic-extras
bnewbold-openapi-client-generator-v601
bnewbold-pythonclient-types
bnewbold-redoc
bnewbold-rust-gen-v5
bnewbold-sitemap
bnewbold-ubuntu-jammy
cockroach
confluent-kafka
master
preview
x-attic-auth-other-macaroon-lib
x-attic-camp
x-attic-changelog-export
x-attic-chocula
x-attic-cockroach
x-attic-golang
x-attic-more-importers
x-attic-preview
x-attic-python-rust-hacks
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
extra
Commit message (
Expand
)
Author
Age
Files
Lines
*
sql table size script: shorter output
Bryan Newbold
2020-01-15
1
-0
/
+1
*
2019-01-07 status update
Bryan Newbold
2020-01-07
2
-0
/
+36
*
DB loads take a long time now
Bryan Newbold
2019-12-21
1
-1
/
+1
*
add 2019-12-20 stats
Bryan Newbold
2019-12-20
2
-0
/
+148
*
add kafka-pixy to docker-compose file
Bryan Newbold
2019-12-10
1
-0
/
+8
*
tweaks to docker-compose image
Bryan Newbold
2019-12-10
1
-0
/
+5
*
increase max.message.bytes in container
Martin Czygan
2019-12-05
1
-0
/
+1
*
export raw affiliation strings for analysis
Bryan Newbold
2019-10-03
1
-0
/
+17
*
docker-compose: kafka 2.0, and -dev topic names
Bryan Newbold
2019-09-20
1
-3
/
+2
*
document release publish process
v0.3.1
Bryan Newbold
2019-09-18
1
-0
/
+48
*
create new collection just for fatcat exports
Bryan Newbold
2019-09-09
1
-1
/
+1
*
update more rust library name refs
Bryan Newbold
2019-09-05
1
-4
/
+4
*
update all other mentions of python client lib
Bryan Newbold
2019-09-05
3
-9
/
+9
*
sql_dumps: typo
Bryan Newbold
2019-07-14
1
-1
/
+1
*
more fixup notes (from QA server)
Bryan Newbold
2019-06-27
1
-5
/
+46
*
finish fixup_longtail_issnl_unique; but not going to run it
Bryan Newbold
2019-06-27
1
-4
/
+3
*
initial work on longtail_issnl_unique.py
Bryan Newbold
2019-06-24
1
-0
/
+192
*
stats.json update after releases v03 cut-over
Bryan Newbold
2019-06-06
1
-0
/
+1
*
elasticsearch index alias howto
Bryan Newbold
2019-06-06
1
-1
/
+16
*
QA checks (for hash, extid duplication)
Bryan Newbold
2019-06-04
4
-0
/
+82
*
recent prod table sizes; 380 GBytes or so total
Bryan Newbold
2019-06-04
1
-0
/
+233
*
dump_release_extid.sql changes for new schema
Bryan Newbold
2019-06-03
1
-1
/
+1
*
move export README info to sql_dumps doc
Bryan Newbold
2019-06-03
1
-1
/
+29
*
fix parse_merge_metadata.py merge_spans()
Bryan Newbold
2019-05-30
1
-4
/
+8
*
better KBART merging
Bryan Newbold
2019-05-30
1
-4
/
+5
*
initial code to handle multiple KBART spans better
Bryan Newbold
2019-05-30
1
-2
/
+64
*
add work-in-progress elastic index notes
Bryan Newbold
2019-05-30
1
-0
/
+11
*
add 'superceded' release extra flag to elastic schema
Bryan Newbold
2019-05-23
1
-0
/
+1
*
also track work_id in release elasticsearch table
Bryan Newbold
2019-05-22
1
-0
/
+1
*
count linked refs (not just raw refs) in elasticsearch
Bryan Newbold
2019-05-22
1
-0
/
+1
*
commit SQL table stats scripts
Bryan Newbold
2019-05-21
2
-0
/
+36
*
include creator_ids in release elastic schema
Bryan Newbold
2019-05-20
1
-0
/
+1
*
elastic release schema update
Bryan Newbold
2019-05-20
1
-1
/
+6
*
start tracking stats
Bryan Newbold
2019-05-07
2
-0
/
+2
*
IA collection page embed example description
Bryan Newbold
2019-05-07
1
-0
/
+45
*
old fileset and webcapture example entities
Bryan Newbold
2019-04-30
2
-0
/
+146
*
no-derive metadata and SQL dump uploads (to petabox)
Bryan Newbold
2019-04-30
1
-0
/
+2
*
faster elasticsearch imports
Bryan Newbold
2019-04-30
1
-1
/
+1
*
more bots to bootstrap
Bryan Newbold
2019-04-24
1
-0
/
+15
*
update sql dump README
Bryan Newbold
2019-04-24
1
-9
/
+12
*
fix wild elastic schema typo
Bryan Newbold
2019-04-12
1
-1
/
+1
*
record webcaptures added as demos
Bryan Newbold
2019-03-19
1
-0
/
+45
*
new importer: wayback_static
Bryan Newbold
2019-03-19
1
-203
/
+0
*
update enrich examples demo script
Bryan Newbold
2019-03-19
1
-49
/
+63
*
initial wayback-to-webcapture helper
Bryan Newbold
2019-03-19
1
-0
/
+203
*
more integration of transform refactor
Bryan Newbold
2019-03-11
1
-2
/
+2
*
elastic schema indentation
Bryan Newbold
2019-03-06
1
-6
/
+6
*
gitignore SQL identifier dumps
Bryan Newbold
2019-02-22
1
-0
/
+1
*
include container_id in release ES schema
Bryan Newbold
2019-02-22
1
-0
/
+1
*
update ISSN-L file
Bryan Newbold
2019-02-20
2
-2
/
+6
[next]