GPU Recommendation for Parallel Computing

Question

0 개 추천

Hi. I am trying speed up a boosted tree learning algorithm via parfor. I have been able to get it running on AWS, but this hasn't proven to be an ideal solution for development work, as AWS charges a lot for keeping the cluster in an online state and takes a fair amount of time to change the state from offline to online. And so, I am interested in exploring the possibility of doing some of the development work using a local GPU cluster instead of AWS. Can you recommend a decent GPU (@ ~$1000) for a problem that requires 100-500 iterations, each of which takes around 3 minutes to run in serial on a decent laptop, and relies on around 200MB of data to be passed and processed by each worker? Or is this not a sensible route to pursue given my problem and budget? I just don't have a good sense of the extent to which such a problem could be parallelized using a single GPU (and whether the memory or the processing capacity of the individual GPU workers will be the binding constraint).

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

Matt J 2019년 2월 14일

It's a good start, but we need to see the slow part of the code, presumably growForestp, if we're to recommend ways to optimize it.

Josh Coval 2019년 2월 14일

I'm afraid I may get into trouble if I post much of growForestp (and it also has a large number of lines). That having been said, I'm not really looking to optimize the growForestp code so much as identify a good hardware setup that will allow it to be run in parallel locally instead of on AWS. But I totally understand that this may not give you enough information for you to provide any additional guidance -- and I do appreciate your pointing out that a single local GPU will be a poor substitute.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Matt J 2019년 2월 14일

편집: Matt J 2019년 2월 14일

0 개 추천

Well, the one general thing I can say is that if you convert all of the variables data1...data5 variables to gpuArray objects, then the manipulations done by growForestp would likely be considerably faster assuming they consist of a lot of matrix arithmetic. In other words, you can use the GPU to gain speed in other ways besides just deploying parallel instances of growForestp.

I don't know what kind of GPU resources the AWS offers. Maybe each cluster node has its own GPU? If you want to implement on your own local cluster sharing a single GPU, I would probably go with the GeForce GTX Titan X (which has 12 GB RAM) or the GeForce GTX 1080 Ti (which has 9 GB RAM). That should easily accomodate jobs from at least 20 parallel workers. Of course, I am not sure what the communications overhead would be from 20 workers trying to share/access a single GPU card...

댓글 수: 2
없음 표시 없음 숨기기

Josh Coval 2019년 2월 14일

super helpful. thanks!

Walter Roberson 2019년 2월 14일

Mathworks recommends against sharing a gpu between parallel workers . The communication overhead of synch is one of the most expensive gpu operations .

댓글을 달려면 로그인하십시오.

GPU Recommendation for Parallel Computing

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

채택된 답변

댓글 수: 2
없음 표시 없음 숨기기

추가 답변 (0개)

카테고리

제품

릴리스

태그

Community Treasure Hunt

GPU Recommendation for Parallel Computing

댓글 수: 4 이전 댓글 2개 표시 이전 댓글 2개 숨기기

채택된 답변

댓글 수: 2 없음 표시 없음 숨기기

추가 답변 (0개)

카테고리

제품

릴리스

태그

참고 항목

Community Treasure Hunt

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

댓글 수: 2
없음 표시 없음 숨기기