diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2020-03-28 20:01:46 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2020-03-28 20:01:48 -0700 |
commit | d6af7b7544ddb3b5e7b1f4a0fd76bd9cd5ed9125 (patch) | |
tree | 924f23ae748b94d8604a0e92f952ddf9562a5c93 /python/fatcat_ingest.py | |
parent | 4b75a81cbd0faeefa6a0f04b97ecc6832924ee69 (diff) | |
download | fatcat-d6af7b7544ddb3b5e7b1f4a0fd76bd9cd5ed9125.tar.gz fatcat-d6af7b7544ddb3b5e7b1f4a0fd76bd9cd5ed9125.zip |
pubmed: bunch of .get_text() instead of .string
Yikes! Apparently when a tag has child tags, .string will return None
instead of all the strings. .get_text() returns all of it:
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#get-text
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#string
I've things like identifiers as .string, when we expect only a single
string inside.
Diffstat (limited to 'python/fatcat_ingest.py')
0 files changed, 0 insertions, 0 deletions