java - EC2上のJGroupsノードは、お互いを見ているのに話をしていません

Question

Hibernate Searchを使用して、jgroupsSlaveノードからLuceneインデックスへのすべての書き込みがjgroupsMasterノードに送信され、LuceneインデックスがInfinispanでスレーブに共有されるようにしようとしています。すべてがローカルで機能しますが、ノードはEC2でお互いを検出しますが、通信していないようです。

彼らは両方ともお互いに生きているメッセージを送っています。

# master output sample
86522 [LockBreakingService,localCache,archlinux-37498] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
86523 [LockBreakingService,LuceneIndexesLocking,archlinux-37498] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
87449 [Timer-4,luceneCluster,archlinux-37498] DEBUG org.jgroups.protocols.FD  - sending are-you-alive msg to archlinux-57950 (own address=archlinux-37498)
87522 [LockBreakingService,localCache,archlinux-37498] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
87523 [LockBreakingService,LuceneIndexesLocking,archlinux-37498] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0

# slave output sample
85499 [LockBreakingService,localCache,archlinux-57950] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
85503 [LockBreakingService,LuceneIndexesLocking,archlinux-57950] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
86190 [Timer-3,luceneCluster,archlinux-57950] DEBUG org.jgroups.protocols.FD  - sending are-you-alive msg to archlinux-37498 (own address=archlinux-57950)
86499 [LockBreakingService,localCache,archlinux-57950] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0
86503 [LockBreakingService,LuceneIndexesLocking,archlinux-57950] DEBUG org.infinispan.transaction.TransactionTable  - About to cleanup completed transaction. Initial size is 0

セキュリティグループ

マスター用とスレーブ用の2つのjarがあり、それぞれのEC2インスタンスで実行しています。各インスタンスに他のインスタンスからpingを実行できます。これらは両方とも同じセキュリティグループに属しており、グループ内の任意のマシン間の通信に関する次のルールを定義しています。

ICMPのすべてのポート0-65535（TCPの場合）0-65535（UDPの場合）

したがって、セキュリティグループの構成の問題ではないと思います。

hibernate.properties

# there is also a corresponding jgroupsSlave
hibernate.search.default.worker.backend=jgroupsMaster
hibernate.search.default.directory_provider = infinispan
hibernate.search.infinispan.configuration_resourcename=infinispan.xml
hibernate.search.default.data_cachename=localCache
hibernate.search.default.metadata_cachename=localCache

infinispan.xml

<infinispan xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
            xsi:schemaLocation="urn:infinispan:config:5.1 http://www.infinispan.org/schemas/infinispan-config-5.1.xsd"
            xmlns="urn:infinispan:config:5.1">
    <global>
        <transport clusterName="luceneCluster" transportClass="org.infinispan.remoting.transport.jgroups.JGroupsTransport">
            <properties>
                <property name="configurationFile" value="jgroups-ec2.xml" />
            </properties>
        </transport>
    </global>

    <default>
        <invocationBatching enabled="true" />
        <clustering mode="repl">

        </clustering>
    </default>

    <!-- this is just so that each machine doesn't have to store the index
         in memory -->
    <namedCache name="localCache">
        <loaders passivation="false" preload="true" shared="false">
            <loader class="org.infinispan.loaders.file.FileCacheStore" fetchPersistentState="true" ignoreModifications="false" purgeOnStartup="false">
                <properties>
                    <property name="location" value="/tmp/infinspan/master" />
                    <!-- there is a corresponding /tmp/infinispan/slave in
                    the slave config -->
                </properties>
            </loader>
        </loaders>
    </namedCache>
</infinispan>

jgroups-ec2.xml

<config xmlns="urn:org:jgroups" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
        xsi:schemaLocation="urn:org:jgroups http://www.jgroups.org/schema/JGroups-3.2.xsd">
    <TCP
            bind_addr="${jgroups.tcp.address:127.0.0.1}"
            bind_port="${jgroups.tcp.port:7800}"
            loopback="true"
            port_range="30"
            recv_buf_size="20000000"
            send_buf_size="640000"
            max_bundle_size="64000"
            max_bundle_timeout="30"
            enable_bundling="true"
            use_send_queues="true"
            sock_conn_timeout="300"
            enable_diagnostics="false"

            bundler_type="old"

            thread_pool.enabled="true"
            thread_pool.min_threads="2"
            thread_pool.max_threads="30"
            thread_pool.keep_alive_time="60000"
            thread_pool.queue_enabled="false"
            thread_pool.queue_max_size="100"
            thread_pool.rejection_policy="Discard"

            oob_thread_pool.enabled="true"
            oob_thread_pool.min_threads="2"
            oob_thread_pool.max_threads="30"
            oob_thread_pool.keep_alive_time="60000"
            oob_thread_pool.queue_enabled="false"
            oob_thread_pool.queue_max_size="100"
            oob_thread_pool.rejection_policy="Discard"
            />
    <S3_PING secret_access_key="removed_for_stackoverflow" access_key="removed_for_stackoverflow" location="jgroups_ping" />

    <MERGE2 max_interval="30000"
            min_interval="10000"/>
    <FD_SOCK/>
    <FD timeout="3000" max_tries="3"/>
    <VERIFY_SUSPECT timeout="1500"/>
    <pbcast.NAKACK2
            use_mcast_xmit="false"
            xmit_interval="1000"
            xmit_table_num_rows="100"
            xmit_table_msgs_per_row="10000"
            xmit_table_max_compaction_time="10000"
            max_msg_batch_size="100"
            become_server_queue_size="0"/>
    <UNICAST2
            max_bytes="20M"
            xmit_table_num_rows="20"
            xmit_table_msgs_per_row="10000"
            xmit_table_max_compaction_time="10000"
            max_msg_batch_size="100"/>
    <RSVP />
    <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
                   max_bytes="400000"/>
    <pbcast.GMS print_local_addr="false" join_timeout="7000" view_bundling="true"/>
    <UFC max_credits="2000000" min_threshold="0.10"/>
    <MFC max_credits="2000000" min_threshold="0.10"/>
    <FRAG2 frag_size="60000"/>
</config>

これを最新のinfinispan-coreディストリビューション（5.2.0.Beta3ですが、5.1.4も試してみました）から直接コピーしました。私が変更したのは、s3_pingを私のものに置き換えただけですが、ノードがs3に書き込んでいるのがわかり、ノードがお互いを見つけているので、それは問題ではないと思います。また、jgroups.tcp.addressの環境変数をプライベートIPアドレスに設定してマスター/スレーブを起動しています。また、大幅に簡略化されたいくつかの構成を試しましたが、成功しませんでした。

問題が何であるかについてのアイデアはありますか？私はそれで遊んで数日を過ごしました、そしてそれは私を夢中にさせています。ローカルで動作し、EC2で通信できないため、jgroups設定を使用したものである必要があると思います。

あなたたちがこれを理解するのを手伝いたい他の情報はありますか？

score 6 · Accepted Answer

2 つの JGroups チャネルが開始されているため、2 つの JGroups 構成を指定する必要があります。1 つは Infinispan 用で、もう 1 つはバックエンドワーカー通信用です。

Infinispan とjgroupsMasterはどちらも、指定しない限りデフォルトの構成設定を使用しますが、デフォルトでは EC2 では機能しないマルチキャストが使用されます。

Infinispan インデックスの設定が正しく設定されているようですが、S3_PING または JDBC_PING も使用するようにjgroupsMasterワーカーを再設定する必要があります。デフォルト設定ではマルチキャストを使用してピアを自動検出できるため、ローカルで機能している可能性があります。

この重複はHSEARCH-882によって解決されます。構成が大幅に簡素化されることを楽しみにしています。

java - EC2上のJGroupsノードは、お互いを見ているのに話をしていません

1 に答える 1

Related