0

返された HTML は、ブラウザを再起動するように指示し続けており、少し迷っています。

require 'rubygems'
require 'mechanize'

def getHtml(the_url)
  agent = Mechanize.new
  agent.keep_alive = false
  agent.user_agent = "gibsonSim"
  agent.user_agent_alias = "Mechanize"
  agent.redirect_ok = true
  agent.add_auth('www.http://corpus2.byu.edu/','omitted', 'omitted')
  resp = agent.get(the_url)
  puts resp.body
  return resp   
end

url = "http://corpus2.byu.edu/glowbe/x2.asp?     chooser=seq&p=%5Bsolid%5D&w2=&wl=4&wr=4&r1=&r2=&ipos1=-select-&B7=SEARCH&showsec=y&sec1=0&sec2=0&sortBy=freq&sortByDo2=freq&minfreq1=freq&freq1=20&freq2=20&numhits=100&kh=100&groupBy=words&whatshow=raw&saveList=no&changed=&corpus=glowbe&word=&sbs=&sbs1=&sbsreg1=&sbsr=&sbsgroup=&redidID=&ownsearch=y&compared=&holder=&whatdo=seq&rand1=y&whatdo1=1&didRandom=n&minFreq=freq&s1=0&s2=0&s3=0&perc=mi"
puts getHtml(url)

これが Mechanize で毎回発生する理由はよくわかりませんが、Chrome ではたまにしか発生しません。

返される HTML は次のとおりです。

<style>

<!--



option { font-family: Verdana; font-size: 9px }
input { font-family: Verdana; font-size: 9px }
body { font-family: Verdana; font-size: 11px }
div { font-family: Verdana; font-size: 11px }
p { font-family: Verdana; font-size: 11px }
td { font-family: Verdana; font-size: 11px }



-->
</style>

<html>

<head>
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">
<title>New Page 1</title>
<script language=Javascript>

function x(x1)
{
top.lefto.document.zabba.reset();
top.lefto.document.zabba.p.value = x1;
top.lefto.document.zabba.wl.options[0].selected = true;
top.lefto.document.zabba.whatsee[0].checked='true';
top.lefto.document.zabba.submit();
}

function x()
{
top.lefto.document.zabba.submit();
}

</script>

</head>


<body>


<div align="center">
<table align="center" border="0" cellpadding="10" cellspacing="0" style="border-    collapse: collapse" bordercolor="#111111" width="70%" id="AutoNumber1">
<tr><td style="background-color: #FFFFFF">&nbsp;</td></tr>
  <tr>
    <td align="center" width="100%">


Please close your browser <b>completely</b>, and then open your browser and start a new session.


</td>
  </tr>
</table>
<p>&nbsp;</p>
</body>
</html>

あなたが与えることができるどんな助けにも感謝します!

4

1 に答える 1