index
:
sandcrawler
bnewbold-args
bnewbold-backfill
bnewbold-persist-grobid-errors
bnewbold-refactor-loggging
master
trawler
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Branch
Commit message
Author
Age
master
pytest: skip warning in gwb
Bryan Newbold
23 months
bnewbold-refactor-loggging
WIP: refactor logging calls in ingest pipelines
Bryan Newbold
2 years
trawler
notes on re-GROBID-ing (and re-extracting) some files
Bryan Newbold
3 years
bnewbold-persist-grobid-errors
grobid persist: if status_code is not set, default to 0
Bryan Newbold
5 years
bnewbold-args
make hbase_table and zookeeper_hosts CLI args
Bryan Newbold
6 years
bnewbold-backfill
make hbase_table and zookeeper_hosts CLI args
Bryan Newbold
6 years
Age
Commit message
Author
Files
Lines
2018-06-08
make hbase_table and zookeeper_hosts CLI args
bnewbold-backfill
Bryan Newbold
4
-17
/
+32
2018-06-06
Made test data more robust.
Ellen Spertus
1
-2
/
+2
2018-06-06
Removed copied comment.
Ellen Spertus
1
-8
/
+1
2018-06-06
Added job and test for counting mime types.
Ellen Spertus
2
-0
/
+96
2018-06-05
Made package names match directory names. Cleaned up imports.
Ellen Spertus
4
-16
/
+13
2018-06-04
Merge branch 'refactoring' into 'master'
bnewbold
4
-20
/
+101
2018-06-04
Merge branch 'bnewbold-scala-build-fixes' into 'master'
bnewbold
3
-21
/
+19
2018-06-04
Made changes suggested in merge request review.
Ellen Spertus
3
-15
/
+10
2018-06-04
try to run scala tests in gitlab CI
Bryan Newbold
1
-2
/
+12
2018-06-04
fetch SpyGlass jar from archive.org (not local)
Bryan Newbold
2
-19
/
+7
[...]
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler