-
Notifications
You must be signed in to change notification settings - Fork 230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Start Guide for RAPIDS on AWS EMR 6.2 #1371
Conversation
start guide for running Rapids on AWS EMR Spark 3.0
fix space in "spark.task.cpus ":"1",
Mortgage ETL Spark 3.0 Notebook for AWS EMR
add ami id for us-west-2 region
Updated file with EMR 6.2 official release
update pictures with EMR 6.2
minor edits
add notebook results
disable AQE for bug
Minor edits Signed-off-by: Kong Zhao <mgzhao@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In addition to the specific comments I have some general ones:
- It would be nice to have all the files specific to AWS EMR be placed in a directory that is specific to EMR (e.g.:
docs/get-started/aws-emr
). Otherwise thedocs/get-started
directory is going to get even more cluttered as more platforms are added. - The
docs/FAQ.md
should be updated to add a link to the newgetting-started-aws-emr.md
document.
|
||
In the "Edit software settings" field, copy and paste the configuration from the [EMR document](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-rapids.html). You can also create a JSON file on you own S3 bucket. | ||
|
||
![Step 1: Step 1: Software, Configuration and Steps](pics/Rapids_EMR_GUI_1.PNG) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Elsewhere in the docs/
directory we're using img
as the directory name to hold images. It would be nice to keep this consistent. Also RAPIDS should be all caps in the filenames. Interesting it's the only part of the image filenames that is not capitalized. 😄
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@mgzhao Tom pointed out that the config.json didn't have the right settings. I suggest that we provide a sample config.json here. For example; in this cluster you chose g4dn.2xlarge, which has 8 vCPU cores. The right config would be use all the cores, we need to spark.executor.cores=8, spark.task.resource.gpu.amount=0.125.
Here is the sample config for this cluster:
{
"Classification":"spark-defaults",
"Properties":{
"spark.plugins":"com.nvidia.spark.SQLPlugin",
"spark.sql.sources.useV1SourceList":"",
"spark.executor.extraJavaOptions":"-Dai.rapids.cudf.prefer-pinned=true",
"spark.executor.resource.gpu.discoveryScript":"/usr/lib/spark/scripts/gpu/getGpusResources.sh",
"spark.submit.pyFiles":"/usr/lib/spark/jars/xgboost4j-spark_3.0-1.0.0-0.2.0.jar",
"spark.executor.extraLibraryPath":"/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/compat/lib:/usr/local/cuda/lib:/usr/local/cuda/lib64:/usr/lib/hadoop/lib/native:/usr/lib/hadoop-lzo/lib/native:/docker/usr/lib/hadoop/lib/native:/docker/usr/lib/hadoop-lzo/lib/native",
"spark.rapids.sql.concurrentGpuTasks":"4",
"spark.executor.resource.gpu.amount":"1",
"spark.executor.cores":"8",
"spark.task.cpus ":"1",
"spark.task.resource.gpu.amount":"0.125",
"spark.rapids.memory.pinnedPool.size":"2G",
"spark.executor.memoryOverhead":"2G",
"spark.locality.wait":"0s",
"spark.sql.shuffle.partitions":"200",
"spark.sql.files.maxPartitionBytes":"256m",
"spark.sql.adaptive.enabled":"false"
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I updated the FAQ.md and created subdirectories in demos and img for different CSPs. I also updated the file names. I believe Kong had already updated the config settings in the json.
|
||
In the "Edit software settings" field, copy and paste the configuration from the [EMR document](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-rapids.html). You can also create a JSON file on you own S3 bucket. | ||
|
||
![Step 1: Step 1: Software, Configuration and Steps](pics/Rapids_EMR_GUI_1.PNG) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some of hte configs in EMR guide are not right or should be adjusted for cluster setup.
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
update default SPARK config, change to img folder, correct spellings for RAPIDS, fix URL.
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor extra suggestion, but I'm going to go ahead and approve this now since I'll be on vacation for a bit. I don't want to totally block this for the 0.3 release. However it would be good to get a review from @sameerz or @krajendrannv before committing this.
… demo folders by CSP Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
I updated the content, and modified a few related files (FAQ.md, the databricks and gcp dataproc getting started guides) as the img locations were also cleaned up. Please review. |
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Looks good but there are some conflicts that need to be resolved. |
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
I think I resolved all the comments. I am not sure how to resolve the conflicting files - I followed the command line instructions, but no changes appear to have been applied or resolved. |
Did you have the latest branch-0.3 changes in your local repository when you performed the merge from branch-0.3? That would explain the lack of changes. |
Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
build |
* Add files via upload start guide for running Rapids on AWS EMR Spark 3.0 * Add files via upload JSON * Add files via upload fix space in "spark.task.cpus ":"1", * Add files via upload Mortgage ETL Spark 3.0 Notebook for AWS EMR * Add files via upload * Add files via upload * Add files via upload * Add files via upload * Add files via upload add ami id for us-west-2 region * Add files via upload Updated file with EMR 6.2 official release * Add files via upload update pictures with EMR 6.2 * Add files via upload minor edits * Add files via upload add notebook results * Add files via upload disable AQE for bug * Add files via upload * Add files via upload Minor edits Signed-off-by: Kong Zhao <mgzhao@gmail.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Modified based on PR feedback update default SPARK config, change to img folder, correct spellings for RAPIDS, fix URL. * Delete Rapids_EMR_GUI_5.PNG move to img folder * Delete Rapids_EMR_GUI_4.PNG move to img folder * Delete Rapids_EMR_GUI_3.PNG move to img folder * Delete Rapids_EMR_GUI_2.PNG * Delete Rapids_EMR_GUI_1.PNG move to img folder * Delete EMR_notebook_3.png move to img folder * Delete EMR_notebook_2.png move to img folder * Delete EMR_notebook_1.png move to img folder * Update docs for getting started and FAQ for AWS-EMR, organize img and demo folders by CSP Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Updated based on comments, updated to latest sparkconfig.png Signed-off-by: Sameer Raheja <sraheja@nvidia.com> Co-authored-by: Jason Lowe <jlowe@nvidia.com> Co-authored-by: Sameer Raheja <sraheja@nvidia.com> Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
* Add files via upload start guide for running Rapids on AWS EMR Spark 3.0 * Add files via upload JSON * Add files via upload fix space in "spark.task.cpus ":"1", * Add files via upload Mortgage ETL Spark 3.0 Notebook for AWS EMR * Add files via upload * Add files via upload * Add files via upload * Add files via upload * Add files via upload add ami id for us-west-2 region * Add files via upload Updated file with EMR 6.2 official release * Add files via upload update pictures with EMR 6.2 * Add files via upload minor edits * Add files via upload add notebook results * Add files via upload disable AQE for bug * Add files via upload * Add files via upload Minor edits Signed-off-by: Kong Zhao <mgzhao@gmail.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Modified based on PR feedback update default SPARK config, change to img folder, correct spellings for RAPIDS, fix URL. * Delete Rapids_EMR_GUI_5.PNG move to img folder * Delete Rapids_EMR_GUI_4.PNG move to img folder * Delete Rapids_EMR_GUI_3.PNG move to img folder * Delete Rapids_EMR_GUI_2.PNG * Delete Rapids_EMR_GUI_1.PNG move to img folder * Delete EMR_notebook_3.png move to img folder * Delete EMR_notebook_2.png move to img folder * Delete EMR_notebook_1.png move to img folder * Update docs for getting started and FAQ for AWS-EMR, organize img and demo folders by CSP Signed-off-by: Sameer Raheja <sraheja@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Update docs/get-started/getting-started-aws-emr.md Co-authored-by: Jason Lowe <jlowe@nvidia.com> * Updated based on comments, updated to latest sparkconfig.png Signed-off-by: Sameer Raheja <sraheja@nvidia.com> Co-authored-by: Jason Lowe <jlowe@nvidia.com> Co-authored-by: Sameer Raheja <sraheja@nvidia.com> Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
…IDIA#1371) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
No description provided.