python - numpyで2つの配列を内部結合するにはどうすればよいですか?

Question

次のような 2 つの配列があります。

A:

20131010 123 12321 12312312
20131011 123 12321 12312312
20131012 123 12321 12312312
20131013 123 12321 12312312

B:

20131010 bbbb sad sadsad
20131011 asd asdd asdad
20231012 123 12321 12312312
20141013 123 12321 12312312
20141023 123 12321 12312312

ここで、これら 2 つの配列を最初の列 (日付) で内部結合する必要があります。結果は次のようになります。

20131010 123 12321 12312312 bbbb sad sadsad
20131011 123 12321 12312312 asd asdd asdad

どうやって作るの？それぞれに多数の列があるため、すべての列に名前を付けることはできませんが、比較列は実際には 1 つだけです。

score 6 · Accepted Answer

これは恐ろしく文書化されていませんが、チェックアウトしてくださいnumpy.lib.recfunctions.join_by。内部結合を含む、結合のようないくつかの種類の SQL を実行します。このモジュールは numpy ページには表示されませんが、少なくともドキュメント文字列から情報が得られます (以下の 1.9.1 からコピー)。

これが機能するには構造化配列が必要なように見えるので、単に「列 0 で結合」と言うのではなく、再配列にキャストする必要があるかもしれないことに注意してください。

Join arrays `r1` and `r2` on key `key`.

The key should be either a string or a sequence of string corresponding
to the fields used to join the array.  An exception is raised if the
`key` field cannot be found in the two input arrays.  Neither `r1` nor
`r2` should have any duplicates along `key`: the presence of duplicates
will make the output quite unreliable. Note that duplicates are not
looked for by the algorithm.

Parameters
----------
key : {string, sequence}
    A string or a sequence of strings corresponding to the fields used
    for comparison.
r1, r2 : arrays
    Structured arrays.
jointype : {'inner', 'outer', 'leftouter'}, optional
    If 'inner', returns the elements common to both r1 and r2.
    If 'outer', returns the common elements as well as the elements of
    r1 not in r2 and the elements of not in r2.
    If 'leftouter', returns the common elements and the elements of r1
    not in r2.
r1postfix : string, optional
    String appended to the names of the fields of r1 that are present
    in r2 but absent of the key.
r2postfix : string, optional
    String appended to the names of the fields of r2 that are present
    in r1 but absent of the key.
defaults : {dictionary}, optional
    Dictionary mapping field names to the corresponding default values.
usemask : {True, False}, optional
    Whether to return a MaskedArray (or MaskedRecords is
    `asrecarray==True`) or a ndarray.
asrecarray : {False, True}, optional
    Whether to return a recarray (or MaskedRecords if `usemask==True`)
    or just a flexible-type ndarray.

Notes
-----
* The output is sorted along the key.
* A temporary array is formed by dropping the fields not in the key for
  the two arrays and concatenating the result. This array is then
  sorted, and the common entries selected. The output is constructed by
  filling the fields with the selected entries. Matching is not
  preserved if there are some duplicates...

python - numpyで2つの配列を内部結合するにはどうすればよいですか?

A:

B:

1 に答える 1

Related

Reference