0

データを使用して URL を書き込み、Web サイトからデータをスクレイピングしようとしています。私の機能は github にあります: https://github.com/blaquans/RInsee/blob/master/R/inflation.R

小さい数では問題なく動作しますが、大きな数では機能しません:

> inflation(input = 4000, input.currency = "Euro", input.year = 2004, output.currency = "Euro", output.year = 2012)
http://www.insee.fr/fr/themes/calcul-pouvoir-achat.asp?sommeDepart=4000&deviseDepart=Euro&anneeDepart=2004&deviseArrivee=Euro&anneeArrivee=2012
[1] 4569.57
> inflation(input = 400000, input.currency = "Euro", input.year = 2004, output.currency = "Euro", output.year = 2012)
ERROR 
http://www.insee.fr/fr/themes/calcul-pouvoir-achat.asp?sommeDepart=4e+05&deviseDepart=Euro&anneeDepart=2004&deviseArrivee=Euro&anneeArrivee=2012
[1] NA

その理由は、R が 4e+06 で 400000 を変換し、適切な URL を書き込まないためです。R に 4e+06 の代わりに 400000 を強制的に書き込むにはどうすればよいですか?

4

1 に答える 1

1

使用するoptions(scipen =)

inflation <- function(input, input.currency, input.year, output.currency, output.year) {
  oldscipen <- options('scipen')$scipen
  options(scipen = 999)

  require("RCurl")
  tx <- getURL(paste("http://www.insee.fr/fr/themes/calcul-pouvoir-achat.asp?sommeDepart=",input, "&deviseDepart=",input.currency,"&anneeDepart=",input.year, "&deviseArrivee=",output.currency, "&anneeArrivee=",output.year, sep = ""))
  patrick <- ".*<strong class=\"resultat\">([[:digit:][:blank:],]+)[[:blank:]eurosfracin]+</strong>.*"
  if (grepl(pattern = patrick, x = tx) == TRUE){
    out <- sub(pattern = patrick , replacement = "\\1", x = tx)
    out <- as.numeric(gsub( pattern = "[[:blank:]]", replacement = "", x = gsub(pattern = ",", replacement = ".", x = out)))
    }
  else {
    cat("ERROR \n")
    out <- NA
    }
  options(scipen = oldscipen)

  return(out)
  }

あなたの例をテストする:

> inflation(input = 4000, input.currency = "Euro", input.year = 2004, output.currency = "Euro", output.year = 2012)
Loading required package: RCurl
Loading required package: bitops
[1] 4569.57
> 
> inflation(input = 400000, input.currency = "Euro", input.year = 2004, output.currency = "Euro", output.year = 2012)

[1] 456956.5
于 2013-06-14T11:23:24.480 に答える