string - 多くの整数を含む文字配列のatoi

Question

文字配列に整数が入力され (char 配列に変換され)、それを整数に再変換する別の関数によって読み取られるコードがあります。次の関数を使用して、char 配列への変換を取得しました。

char data[64];
int a = 10;
std::string str = boost::lexical_cast<std::string>(a);
memcpy(data + 8*k,str.c_str(),sizeof(str.c_str()));   //k varies from 0 to 7

文字への再変換は、次を使用して行われます。

char temp[8];
memcpy(temp,data+8*k,8);
int a = atoi(temp);

これは一般的には問題なく動作しますが、qt (ver 4.7) を含むプロジェクトの一部としてこれを実行しようとすると、正常にコンパイルされ、memcpy() を使用して読み取ろうとするとセグメンテーションエラーが発生します。セグメンテーション違反は、データの書き込み中ではなく、読み取りループ中にのみ発生することに注意してください。なぜこれが起こるのかわかりませんが、何らかの方法でそれを成し遂げたいと思っています。

それで、文字配列、最初のビットと最後のビットを取り、それを整数に変換できる、私が使用できる他の関数はありますか？その後、 memcpy() を使用する必要はまったくありません。私がやろうとしていることは、次のようなものです。

new_atoi(data,8*k,8*(k+1)); // k varies from 0 to 7

前もって感謝します。

score 9 · Accepted Answer

4 文字のみをコピーしています (システムのポインター幅によって異なります)。これにより、null 以外で終了する 4 文字以上の文字数が残り、atoi への入力でランナウェイ文字列が発生します。

 sizeof(str.c_str()) //i.e. sizeof(char*) = 4 (32 bit systems)

する必要があります

 str.length() + 1

または、文字は nullterminate されません

STL のみ:

^{make_testdata(): ずっと下を見る}

ストリームを使用しないのはなぜですか...?

#include <sstream>
#include <iostream>
#include <algorithm>
#include <iterator>
#include <string>
#include <vector>

int main()
{
    std::vector<int> data = make_testdata();

    std::ostringstream oss;
    std::copy(data.begin(), data.end(), std::ostream_iterator<int>(oss, "\t"));

    std::stringstream iss(oss.str());

    std::vector<int> clone;
    std::copy(std::istream_iterator<int>(iss), std::istream_iterator<int>(),
              std::back_inserter(clone));

    //verify that clone now contains the original random data:
    //bool ok = std::equal(data.begin(), data.end(), clone.begin());

    return 0;
}

単純な C では、atoi/itoa といくつかの微調整を使用して、はるかに高速に実行できますが、速度が必要な場合は、バイナリ転送を使用する必要があると思います (優れたライブラリについては、 Boost Spirit Karmaとprotobufを参照してください)。

カルマ/気のブースト:

#include <boost/spirit/include/qi.hpp>
#include <boost/spirit/include/karma.hpp>

namespace qi=::boost::spirit::qi;
namespace karma=::boost::spirit::karma;

static const char delimiter = '\0';

int main()
{
    std::vector<int> data = make_testdata();

    std::string astext;
//  astext.reserve(3 * sizeof(data[0]) * data.size()); // heuristic pre-alloc
    std::back_insert_iterator<std::string> out(astext);

    {
        using namespace karma;
        generate(out, delimit(delimiter) [ *int_ ], data);
    //  generate_delimited(out, *int_, delimiter, data); // equivalent
    //  generate(out, int_ % delimiter, data); // somehow much slower!
    }

    std::string::const_iterator begin(astext.begin()), end(astext.end());
    std::vector<int> clone;
    qi::parse(begin, end, qi::int_ % delimiter, clone);

    //verify that clone now contains the original random data:
    //bool ok = std::equal(data.begin(), data.end(), clone.begin());

    return 0;
}

代わりに、アーキテクチャに依存しないバイナリシリアライゼーションを行いたい場合は、この小さな適応を使用して、物事~~を無数倍~~高速化します (以下のベンチマークを参照してください... )。

karma::generate(out, *karma::big_dword, data);
// ...
qi::parse(begin, end, *qi::big_dword, clone);

シリアライゼーションのブースト

バイナリモードでブーストシリアライゼーションを使用すると、最高のパフォーマンスが得られます。

#include <sstream>
#include <boost/archive/binary_oarchive.hpp>
#include <boost/archive/binary_iarchive.hpp>
#include <boost/serialization/vector.hpp>

int main()
{
    std::vector<int> data = make_testdata();

    std::stringstream ss;
    {
        boost::archive::binary_oarchive oa(ss);
        oa << data;
    }

    std::vector<int> clone;
    {
        boost::archive::binary_iarchive ia(ss);
        ia >> clone;
    }

    //verify that clone now contains the original random data:
    //bool ok = std::equal(data.begin(), data.end(), clone.begin());

    return 0;
}

テストデータ

(上記全バージョン共通)

#include <boost/random.hpp>

// generates a deterministic pseudo-random vector of 32Mio ints
std::vector<int> make_testdata()
{
    std::vector<int> testdata;

    testdata.resize(2 << 24);
    std::generate(testdata.begin(), testdata.end(), boost::mt19937(0));

    return testdata;
}

ベンチマーク

私はそれをベンチマークしました

2<<24(33554432) ランダム整数の入力データを使用
出力を表示しない (端末のスクロールパフォーマンスを測定したくない)
大まかなタイミングは
- STL のみのバージョンは実際には 12.6 秒でそれほど悪くはありません
- Arlen のヒントのおかげで、Karma/Qi テキストバージョンは ~~18 秒~~ 5.1秒で実行されました:)generate_delimited
- Karma/Qi バイナリバージョン (big_dword) はわずか 1.4 秒 (約~~12 倍~~、3 ～ 4 倍の速さ)
- Boost Serialization は約 0.8 秒 (バイナリの代わりにテキストアーカイブを使用する場合は約 13 秒) でケーキを完成させます。

score 2 · Accepted Answer

Karma/Qi テキストバージョンが STL バージョンより遅くなる理由はまったくありません。その主張を反映するために、Karma/Qi テキストバージョンの @sehe 実装を改善しました。

次のBoost Karma/Qiテキストバージョンは、STL バージョンの2 倍以上高速です。

#include <boost/spirit/include/qi.hpp>
#include <boost/spirit/include/karma.hpp>
#include <boost/random.hpp>
#include <boost/spirit/include/phoenix_core.hpp>
#include <boost/spirit/include/phoenix_operator.hpp>
#include <boost/spirit/include/phoenix_stl.hpp>

namespace ascii = boost::spirit::ascii;
namespace qi = boost::spirit::qi;
namespace karma = boost::spirit::karma;
namespace phoenix = boost::phoenix;


template <typename OutputIterator>
void generate_numbers(OutputIterator& sink, const std::vector<int>& v){

  using karma::int_;
  using karma::generate_delimited;
  using ascii::space;

  generate_delimited(sink, *int_, space, v);
}

template <typename Iterator>
void parse_numbers(Iterator first, Iterator last, std::vector<int>& v){

  using qi::int_;
  using qi::phrase_parse;
  using ascii::space;
  using qi::_1;
  using phoenix::push_back;
  using phoenix::ref;

  phrase_parse(first, last, *int_[push_back(ref(v), _1)], space);
}

int main(int argc, char* argv[]){

  static boost::mt19937 rng(0); // make test deterministic
  std::vector<int> data;
  data.resize(2 << 24);
  std::generate(data.begin(), data.end(), rng);

  std::string astext;
  std::back_insert_iterator<std::string> out(astext);
  generate_numbers(out, data);

  //std::cout << astext << std::endl;

  std::string::const_iterator begin(astext.begin()), end(astext.end());
  std::vector<int> clone;
  parse_numbers(begin, end, clone);

  //verify that clone now contains the original random data:
  //std::copy(clone.begin(), clone.end(), std::ostream_iterator<int>(std::cout, ","));

  return 0;
}

string - 多くの整数を含む文字配列のatoi

2 に答える 2

STL のみ:

カルマ/気のブースト:

シリアライゼーションのブースト

テストデータ

ベンチマーク

Related

Reference