获取个人Pocket推荐流的非官方库
项目描述
pocket-recommendations
获取个人Pocket推荐流的非官方库
用法
获取一个公开的Pocket资料,例如 这个。使用Python或其他方式下载其HTML
$ curl "https://getpocket.com/@honzajavorek" > getpocket-com-honzajavorek.html
在您的Python程序中,准备HTML字符串
>>> from pathlib import Path
>>> html_text = Path('getpocket-com-honzajavorek.html').read_text()
现在您可以使用这个库来解析HTML
>>> import pocket_recommendations
>>> items = pocket_recommendations.parse(html_text)
>>> len(items)
50
每个项目看起来像这样
>>> from pprint import pprint
>>> pprint(items[0])
{'pocket_comment': 'Šablona na váš úspěšný HackerNews post',
'pocket_recommended_at': None,
'pocket_url': 'https://getpocket.com/redirect?&url=https%3A%2F%2Fsaagarjha.com%2Fblog%2F2020%2F05%2F10%2Fwhy-we-at-famous-company-switched-to-hyped-technology%2F&h=eff6d8cac22c9b475463d037037b0efdcf44b762c9b0b7913de2104cab5fa67d',
'title': 'Why we at $FAMOUS_COMPANY Switched to $HYPED_TECHNOLOGY',
'url': 'https://saagarjha.com/blog/2020/05/10/why-we-at-famous-company-switched-to-hyped-technology/'}
尽管Pocket使用HTTP链接进行重定向,但库强制使用HTTPS。
推荐日期
您可以通过指定下载HTML的日期来获取推荐发布的相对日期
>>> from datetime import date
>>> items = pocket_recommendations.parse(html_text, today=date(2020, 6, 3))
>>> pprint(items[0])
{'pocket_comment': 'Šablona na váš úspěšný HackerNews post',
'pocket_recommended_at': datetime.date(2020, 6, 2),
'pocket_url': 'https://getpocket.com/redirect?&url=https%3A%2F%2Fsaagarjha.com%2Fblog%2F2020%2F05%2F10%2Fwhy-we-at-famous-company-switched-to-hyped-technology%2F&h=eff6d8cac22c9b475463d037037b0efdcf44b762c9b0b7913de2104cab5fa67d',
'title': 'Why we at $FAMOUS_COMPANY Switched to $HYPED_TECHNOLOGY',
'url': 'https://saagarjha.com/blog/2020/05/10/why-we-at-famous-company-switched-to-hyped-technology/'}
缺失评论
如果没有评论,则设置为 None
>>> from datetime import date
>>> items = pocket_recommendations.parse(html_text)
>>> pprint(items[15])
{'pocket_comment': None,
'pocket_recommended_at': None,
'pocket_url': 'https://getpocket.com/redirect?&url=https%3A%2F%2Falmad.blog%2Fessays%2Fwhat-is-employment%2F&h=ef4216c9df41763fa900b12815a280bf790f50960468a45ebed5f3682156dc6a',
'title': "We Don't Know What an Employment Is",
'url': 'https://almad.blog/essays/what-is-employment/'}
误解释的HTML实体
如果标题包含一些误解释的HTML实体,库会处理它
>>> from datetime import date
>>> items = pocket_recommendations.parse(html_text)
>>> pprint(items[15]) # title: We Don't Know What an Employment Is
{'pocket_comment': None,
'pocket_recommended_at': None,
'pocket_url': 'https://getpocket.com/redirect?&url=https%3A%2F%2Falmad.blog%2Fessays%2Fwhat-is-employment%2F&h=ef4216c9df41763fa900b12815a280bf790f50960468a45ebed5f3682156dc6a',
'title': "We Don't Know What an Employment Is",
'url': 'https://almad.blog/essays/what-is-employment/'}
项目详情
下载文件
下载适用于您平台的应用程序文件。如果您不确定选择哪个,请了解更多关于 安装包 的信息。
源分布
pocket-recommendations-0.1.1.tar.gz (3.7 kB 查看哈希值)
构建分布
关闭
pocket-recommendations-0.1.1.tar.gz 的哈希值
算法 | 哈希摘要 | |
---|---|---|
SHA256 | 61d2e09d83f0d6b9916bebfe8a3de48019156964f97bb1ebb62ad8d4ef71786d |
|
MD5 | 12d732744516515baa29a4f023d9e605 |
|
BLAKE2b-256 | a3efa9b3df1f8589aae3e65f62119472a7f86043705942fd465b2502fb79f5be |
关闭
pocket_recommendations-0.1.1-py3-none-any.whl 的哈希值
算法 | 哈希摘要 | |
---|---|---|
SHA256 | 70f9970245eb12ee6740f568d4fea33ace294c4ab909a51624776faffc7b14d4 |
|
MD5 | 7b52eee6d56b573766cb45eb4ae0d92c |
|
BLAKE2b-256 | 01371344a839f59c13b7200e1ed820cc08e54fa792e7a4b5aac3304986be5e81 |