index
:
sandcrawler
bnewbold-args
bnewbold-backfill
bnewbold-persist-grobid-errors
bnewbold-refactor-loggging
master
trawler
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
scalding
/
src
/
test
/
scala
/
sandcrawler
/
ScoreJobTest.scala
Commit message (
Expand
)
Author
Age
Files
Lines
*
set a minimum slug size (8 chars)
Bryan Newbold
2018-08-23
1
-9
/
+16
*
Fixed style violations.
Ellen Spertus
2018-08-22
1
-2
/
+1
*
Added ScoreJob test for title-length filtering.
Ellen Spertus
2018-08-22
1
-5
/
+13
*
Merge branch 'bnewbold-match-scale'
Bryan Newbold
2018-08-21
1
-0
/
+2
|
\
|
*
add a trap to ScoreJob
Bryan Newbold
2018-08-20
1
-0
/
+2
*
|
Merge branch 'strings'
Bryan Newbold
2018-08-21
1
-0
/
+1
|
\
\
|
*
|
Reads blacklist from file.
Ellen Spertus
2018-08-20
1
-0
/
+1
|
|
/
*
/
Disabled scalastyle null checking where we want to test null values.
Ellen Spertus
2018-08-20
1
-0
/
+2
|
/
*
change slugification behavior to not split on colon
Bryan Newbold
2018-08-15
1
-16
/
+16
*
handle null status_code lines
Bryan Newbold
2018-08-15
1
-3
/
+7
*
grobid scoring: status_code as signed int, not string
Bryan Newbold
2018-08-15
1
-2
/
+3
*
Fixed style problems (or disabled warning when appropriate) for tests.
Ellen Spertus
2018-08-14
1
-45
/
+52
*
Minor improvements.
Ellen Spertus
2018-08-14
1
-10
/
+7
*
Now ignores grobid entries with status other than 200.
Ellen Spertus
2018-08-14
1
-16
/
+31
*
Pipeline works, all tests pass, no scalastyle errors.
Ellen Spertus
2018-08-13
1
-30
/
+50
*
Snapshot before changing Scorable to find bug.
Ellen Spertus
2018-08-12
1
-5
/
+10
*
Tests pass. Still have changes to do but made huge progress.
Ellen Spertus
2018-08-10
1
-1
/
+1
*
WIP
Ellen Spertus
2018-08-09
1
-0
/
+177