python - PythonでWebからhtml入力値を取得する方法

Question

アクセスするwww.sampleweb.com/reg/と、のような入力値があります。

<input id="input-id" class="input-class" name="myinput" type="text" value="hello world">

Python を使用しての入力のhello world値を取得するにはどうすればよいですか?www.sampleweb.com/reg/

へのアクセス www.sampleweb.com/reg/は次のようになると思います：

url = 'http://www.sampleweb.com/reg/'
urlopen(url)

URLにアクセスするのはこれで正しいですか？

誰かが私のケースについて私を助けることができますか?

前もって感謝します ...

score 1 · Accepted Answer

python htmlパーサーを使用して、(前述のように)urllib経由でgwettingした後、htmlを解析する必要があります。たとえば、BeautifulSoup を使用する場合: http://www.crummy.com/software/BeautifulSoup/bs3/documentation.html#find%28name,%20attrs,%20recursive,%20text,%20 **kwargs%29

あなたの場合、次のようなものです：

soup = BeautifulSoup(html)
inputs=soup.find("input", {"id": "input-id"})
print inputs[0]['value']

score 1 · Accepted Answer

1

BeautifulSoupというライブラリを使用できます

于 2012-06-01T06:37:27.647 に答える

score 0 · Accepted Answer

リソースの HTML を解析するには、 DOM パーサーを使用するのが最適なオプションであることに注意してください。

ただし、「hello world」だけが HTML に必要な場合は、次のような手っ取り早い方法があります。

toFind = '<input id="input-id" class="input-class" name="myinput" type="text" value="'
htmlStr = urllib.urlopen('yoururl.com/your/path').read()
value = htmlStr[htmlStr.index(toFind)+len(toFind):]
value = htmlStr[:htmlStr.index('\"')]
print value

python - PythonでWebからhtml入力値を取得する方法

3 に答える 3

Related

Reference