並列 for ループ内にタスク ステータス レポート機能を実装しようとしています。この for ループの並列化は「OPENMP」を使って行われています。
ステータスレポートを次のように実行したい:
Work done 70%; estimated time left 3:30:05 hour.
もちろん、「開始時刻」と「現在時刻」の差を計算することで、「推定残り時間」を計算できます。しかし、「静的」宣言を使用しても、forループ内で「作業完了」を正確に計算できないようです。
いくつかのガイダンスをいただければ幸いです。
私のコードの出力:
Values of cores : 8
Outer loop =================================
Thread 0 iCount0
% of work done 10
Outer loop =================================
Thread 0 iCount1
Outer loop =================================
Thread 2 iCount2
Outer loop =================================
Thread 7 iCount3
% of work done 40
Outer loop =================================
Thread 5 iCount4
% of work done 50
Outer loop =================================
Thread 3 iCount5
% of work done 60
Outer loop =================================
Thread 4 iCount6
% of work done 70
Outer loop =================================
Thread 1 iCount7
% of work done 20
% of work done 80
Outer loop =================================
Thread 6 iCount8
% of work done 90
Outer loop =================================
Thread 1 iCount9
% of work done 100
% of work done 30
出力の最後の 2 行からわかるように、ジョブのステータスを正しく計算できません。
これが私のコードです:
注: 出力バッファを何らかの形でフラッシュすると、work% の計算が台無しになるため、"\n" ではなく "std::endl" を意図的に使用しました。並列内で実際の計算を実行すると、同様のシナリオが発生すると確信しています
#include "stdafx.h"
#include <iostream> // std::cout, std::endl
#include <iomanip> // std::setfill, std::setw
#include <math.h> /* pow */
#include <omp.h>
int main(int argc, char** argv)
{
// Get the number of processors in this system
int iCPU = omp_get_num_procs();
// Now set the number of threads
omp_set_num_threads(iCPU);
std::cout << "Values of cores : " << iCPU <<" \n";
int x = 0;
int iTotalOuter = 10;
static int iCount = 0;
#pragma omp parallel for private(x)
for(int y = 0; y < iTotalOuter; y++)
{
std::cout << "Outer loop =================================\n" ;
std::cout <<"\nThread "<<omp_get_thread_num()<<" iCount" << iCount<<std::endl;
for(x = 0; x< 5; x++)
{
//std::cout << "Inner loop \n" ;
}
iCount = iCount + 1;
std::cout <<"\n % of work done " << (double)100*((double)iCount/(double)iTotalOuter)<<std::endl;
}
std::cin.ignore(); //Wait for user to hit enter
return 0;
}
更新: 「Avi Ginsburg」の回答に基づいて、次のようにしようとしています:
#include "stdafx.h"
#include <iostream> // std::cout, std::endl
#include <iomanip> // std::setfill, std::setw
#include <math.h> /* pow */
#include <omp.h>
void ReportJobStatus(int , int );
int main(int argc, char** argv)
{
// Get the number of processors in this system
int iCPU = omp_get_num_procs();
// Now set the number of threads
omp_set_num_threads(iCPU);
std::cout << "Values of cores : " << iCPU <<" \n";
int x = 0;
int iTotalOuter = 100;
static int iCount = 0;
#pragma omp parallel for private(x)
for(int y = 0; y < iTotalOuter; y++)
{
std::cout << "Outer loop =================================\n" ;
for(x = 0; x< 5; x++)
{
//std::cout << "Inner loop \n" ;
}
#pragma omp atomic
iCount++;
std::cout<< " omp_get_thread_num(): " << omp_get_thread_num() <<"\n";
if (omp_get_thread_num() == 0){
ReportJobStatus(iCount, iTotalOuter);
}
}
std::cin.ignore(); //Wait for user to hit enter
return 0;
}
問題 (更新):問題は、同時実行に同じスレッドが使用されていることです。そのため、「完了した作業」レポートは大幅に制限されます。データに基づいて、さまざまなコアにジョブを割り当てるにはどうすればよいでしょうか。
私のコードの現在の出力は次のとおりです。
Outer loop =================================
omp_get_thread_num(): 0
% of work done 1
Outer loop =================================
omp_get_thread_num(): 0
% of work done 2
Outer loop =================================
omp_get_thread_num(): 0
% of work done 3
Outer loop =================================
omp_get_thread_num(): 0
% of work done 4
Outer loop =================================
omp_get_thread_num(): 0
% of work done 5
Outer loop =================================
omp_get_thread_num(): 0
% of work done 6
Outer loop =================================
omp_get_thread_num(): 0
% of work done 7
Outer loop =================================
omp_get_thread_num(): 0
% of work done 8
Outer loop =================================
omp_get_thread_num(): 0
% of work done 9
Outer loop =================================
omp_get_thread_num(): 0
% of work done 10
Outer loop =================================
omp_get_thread_num(): 0
% of work done 11
Outer loop =================================
omp_get_thread_num(): 0
% of work done 12
Outer loop =================================
omp_get_thread_num(): 0
% of work done 13
Outer loop =================================
omp_get_thread_num(): 0
% of work done 14
Outer loop =================================
omp_get_thread_num(): 0
% of work done 15
Outer loop =================================
omp_get_thread_num(): 0
% of work done 16
Outer loop =================================
omp_get_thread_num(): 0
% of work done 17
Outer loop =================================
omp_get_thread_num(): 0
% of work done 18
Outer loop =================================
omp_get_thread_num(): 0
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 1
% of work done 19
Outer loop =================================
omp_get_thread_num(): 0
% of work done 54
Outer loop =================================
omp_get_thread_num(): 0
% of work done 55
Outer loop =================================
omp_get_thread_num(): 0
% of work done 56
Outer loop =================================
omp_get_thread_num(): 0
% of work done 57
Outer loop =================================
omp_get_thread_num(): 0
% of work done 58
Outer loop =================================
omp_get_thread_num(): 0
% of work done 59
Outer loop =================================
omp_get_thread_num(): 0
% of work done 60
Outer loop =================================
omp_get_thread_num(): 0
% of work done 61
Outer loop =================================
omp_get_thread_num(): 0
% of work done 62
Outer loop =================================
omp_get_thread_num(): 6
Outer loop =================================
omp_get_thread_num(): 6
Outer loop =================================
omp_get_thread_num(): 6
Outer loop =================================
omp_get_thread_num(): 6
Outer loop =================================
omp_get_thread_num(): 6
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 3
Outer loop =================================
omp_get_thread_num(): 1
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
omp_get_thread_num(): 5
Outer loop =================================
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
Outer loop =================================
omp_get_thread_num(): 4
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 7
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2
Outer loop =================================
omp_get_thread_num(): 2