python - Paste.httpserverとHTTP/1.1Keep-aliveによるスローダウン。httperfとabでテスト済み

Question

HTTPとWSGIの間のアダプターとしてpaste.httpserverに基づくWebサーバーがあります。httperfを使用してパフォーマンス測定を行う場合、-num-connを使用して毎回新しいリクエストを開始すると、1秒あたり1,000を超えるリクエストを実行できます。代わりに--num-callを使用して接続を再利用すると、1秒あたり約11のリクエストが発生し、速度は100分の1になります。

abを試してみると、タイムアウトになります。

私のテストは

% ./httperf --server localhost --port 8080 --num-conn 100
...
Request rate: 1320.4 req/s (0.8 ms/req)
...

と

% ./httperf --server localhost --port 8080 --num-call 100
...
Request rate: 11.2 req/s (89.4 ms/req)
...

これが単純な再現可能なサーバーです

from paste import httpserver

def echo_app(environ, start_response):
    n = 10000
    start_response("200 Ok", [("Content-Type", "text/plain"),
                              ("Content-Length", str(n))])
    return ["*" * n]

httpserver.serve(echo_app, protocol_version="HTTP/1.1")

これはマルチスレッドサーバーであり、プロファイリングが困難です。これがシングルスレッドのバリエーションです。

from paste import httpserver

class MyHandler(httpserver.WSGIHandler):
    sys_version = None
    server_version = "MyServer/0.0"
    protocol_version = "HTTP/1.1"

    def log_request(self, *args, **kwargs):
        pass


def echo_app(environ, start_response):
    n = 10000
    start_response("200 Ok", [("Content-Type", "text/plain"),
                              ("Content-Length", str(n))])
    return ["*" * n]

# WSGIServerBase is single-threaded
server = httpserver.WSGIServerBase(echo_app, ("localhost", 8080), MyHandler)
server.handle_request()

それをプロファイリングする

% python2.6 -m cProfile -o paste.prof paste_slowdown.py

とそれを打つ

%httperf --client=0/1 --server=localhost --port=8080 --uri=/ \ 
   --send-buffer=4096 --recv-buffer=16384 --num-conns=1 --num-calls=500

私は次のようなプロファイルを取得します

>>> p=pstats.Stats("paste.prof")
>>> p.strip_dirs().sort_stats("cumulative").print_stats()
Sun Nov 22 21:31:57 2009    paste.prof

         109749 function calls in 46.570 CPU seconds

   Ordered by: cumulative time

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.000    0.000   46.571   46.571 {execfile}
        1    0.001    0.001   46.570   46.570 paste_slowdown.py:2(<module>)
        1    0.000    0.000   46.115   46.115 SocketServer.py:250(handle_request)
        1    0.000    0.000   44.675   44.675 SocketServer.py:268(_handle_request_noblock)
        1    0.000    0.000   44.675   44.675 SocketServer.py:301(process_request)
        1    0.000    0.000   44.675   44.675 SocketServer.py:318(finish_request)
        1    0.000    0.000   44.675   44.675 SocketServer.py:609(__init__)
        1    0.000    0.000   44.675   44.675 httpserver.py:456(handle)
        1    0.001    0.001   44.675   44.675 BaseHTTPServer.py:325(handle)
      501    0.006    0.000   44.674    0.089 httpserver.py:440(handle_one_request)
     2001    0.020    0.000   44.383    0.022 socket.py:373(readline)
      501   44.354    0.089   44.354    0.089 {method 'recv' of '_socket.socket' objects}
        1    1.440    1.440    1.440    1.440 {select.select}
         ....

ほぼ常にrecvにあることがわかります。

私はhttprefを利用して、独自のHTTP / 1.1-with-keep-aliveリクエストを作成し、netcatを使用して送信することにしました。

GET / HTTP/1.1
Location: localhost
Connection: Keep-Alive
Content-Length: 0

GET / HTTP/1.1
Location: localhost
Connection: Keep-Alive
Content-Length: 0

 ... repeat 97 more times, to have 99 keep-alives in total ...

GET / HTTP/1.1
Location: localhost
Connection: Close
Content-Length: 0

一緒に送った

nc localhost 8080 < ~/src/send_to_paste.txt

100リクエストの合計時間は0.03秒だったので、非常に優れたパフォーマンスです。

これは、httperfが何か間違ったことをしていることを示唆しています（ただし、広く使用され、尊敬されているコードです）。そこで、「ab」を試しました。

% ab -n 100 -k localhost:8080/
This is ApacheBench, Version 1.3d <$Revision: 1.73 $> apache-1.3
Copyright (c) 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Copyright (c) 2006 The Apache Software Foundation, http://www.apache.org/

Benchmarking localhost (be patient)...
Server timed out

: Operation now in progress

サーバーをインストルメント化して、1つの要求を処理し、2番目の要求を待機します。

何が起こっているのかについて何か考えはありますか？

score 6 · Accepted Answer

いくつかの努力の後、それはNagle のアルゴリズムまたは遅延 ACK、またはそれらの間の相互作用のいずれかのようです。みたいなことしたら消える

server.socket.setsockopt(socket.IPPROTO_TCP, socket.TCP_NODELAY, 1)

どうやって追跡したの？まず、socket.py 内のすべての「recv」を計測して、どの recv が待機しているかを把握できるようにしました。11 回のうち約 5 回の受信で、ほぼ 200 ミリ秒の遅延があったことがわかります。なぜ遅延が発生したのかわかりませんでした。次に、Wireshark を使用してメッセージを監視したところ、実際にはサーバーからクライアントへの送信で遅延が発生していることに気付きました。これは、クライアントからの送信メッセージの TCP 層に何かが含まれていることを意味していました。

友人が明らかなことを提案してくれたので、「200ms ソケット遅延」を検索したところ、この問題の説明が見つかりました。

ペーストトラックレポートはhttp://trac.pythonpaste.org/pythonpaste/ticket/392にあり、ハンドラーが HTTP/1.1 を使用する場合に TCP_NODELAY を有効にするパッチがあります。

python - Paste.httpserverとHTTP/1.1Keep-aliveによるスローダウン。httperfとabでテスト済み

1 に答える 1

Related

Reference