Web ページからすべてのビデオ リンク参照をビデオ名とともに取得しようとしています。以下のコードを試しました。
#!/usr/bin/python3
from bs4 import BeautifulSoup
import requests
import urllib
url = urllib.request.urlopen('https://www.ansible.com/resources/videos').read()
acc_link = BeautifulSoup(url, features="lxml")
for line in acc_link.find_all('a'):
print(line.get('href'))
出力:
https://www.ansible.com/?hsLang=en-us
https://www.ansible.com/overview/it-automation?hsLang=en-us
https://www.ansible.com/overview/it-automation?hsLang=en-us
https://www.ansible.com/overview/how-ansible-works?hsLang=en-us
https://www.ansible.com/products/automation-platform?hsLang=en-us
https://www.ansible.com/use-cases?hsLang=en-us
https://www.ansible.com/use-cases/provisioning?hsLang=en-us
https://www.ansible.com/use-cases/configuration-management?hsLang=en-us
https://www.ansible.com/use-cases/application-deployment?hsLang=en-us
https://www.ansible.com/use-cases/continuous-delivery?hsLang=en-us
https://www.ansible.com/use-cases/security-automation?hsLang=en-us
https://www.ansible.com/use-cases/orchestration?hsLang=en-us
https://www.ansible.com/integrations?hsLang=en-us
HTML ソース コードの例:
<h4><a href="https://www.ansible.com/resources/webinars-training/ansible-network-automation-with-arista-cloudvision-and-arista?hsLang=en-us">Ansible Network Automation with Arista CloudVision and Arista Validated Designs</a></h4>
上記のように、リンクhttps://www.ansible.com/resources/videos i want link name as https://www.ansible.com/resources/webinars-training/ansible-network-automation-with-arista-cloudvision-and-arista
and vedio nameの HTML ソース コードの例にすぎませんAnsible Network Automation with Arista CloudVision and Arista Validated Designs
。
以下はhref
、前?
とa
値が欲しい別の例ですScale-out Clustering with Tower 3.1
。
<h4><a href="https://www.ansible.com/scale-out-clustering-tower?hsLang=en-us">Scale-out Clustering with Tower 3.1</a></h4>
望ましい出力:
Vedio Name: Ansible Network Automation with Arista CloudVision and Arista Validated Designs
高度なヘルプをありがとう。