Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Start Guide for RAPIDS on AWS EMR 6.2 #1371

Merged
merged 34 commits into from
Jan 6, 2021
Merged

Conversation

mgzhao
Copy link
Contributor

@mgzhao mgzhao commented Dec 11, 2020

No description provided.

start guide for running Rapids on AWS EMR Spark 3.0
fix  space in "spark.task.cpus ":"1",
Mortgage ETL Spark 3.0 Notebook for AWS EMR
add ami id for  us-west-2 region
Updated file with EMR 6.2 official release
update pictures with EMR 6.2
add notebook results
disable AQE for bug
Minor edits

Signed-off-by: Kong Zhao <mgzhao@gmail.com>
@jlowe jlowe changed the title Start Guide for Rapids on AWS EMR 6.2 Start Guide for RAPIDS on AWS EMR 6.2 Dec 14, 2020
@jlowe jlowe added the documentation Improvements or additions to documentation label Dec 14, 2020
Copy link
Member

@jlowe jlowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In addition to the specific comments I have some general ones:

  • It would be nice to have all the files specific to AWS EMR be placed in a directory that is specific to EMR (e.g.: docs/get-started/aws-emr). Otherwise the docs/get-started directory is going to get even more cluttered as more platforms are added.
  • The docs/FAQ.md should be updated to add a link to the new getting-started-aws-emr.md document.

docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved

In the "Edit software settings" field, copy and paste the configuration from the [EMR document](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-rapids.html). You can also create a JSON file on you own S3 bucket.

![Step 1: Step 1: Software, Configuration and Steps](pics/Rapids_EMR_GUI_1.PNG)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Elsewhere in the docs/ directory we're using img as the directory name to hold images. It would be nice to keep this consistent. Also RAPIDS should be all caps in the filenames. Interesting it's the only part of the image filenames that is not capitalized. 😄

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mgzhao Tom pointed out that the config.json didn't have the right settings. I suggest that we provide a sample config.json here. For example; in this cluster you chose g4dn.2xlarge, which has 8 vCPU cores. The right config would be use all the cores, we need to spark.executor.cores=8, spark.task.resource.gpu.amount=0.125.
Here is the sample config for this cluster:
{
"Classification":"spark-defaults",
"Properties":{
"spark.plugins":"com.nvidia.spark.SQLPlugin",
"spark.sql.sources.useV1SourceList":"",
"spark.executor.extraJavaOptions":"-Dai.rapids.cudf.prefer-pinned=true",
"spark.executor.resource.gpu.discoveryScript":"/usr/lib/spark/scripts/gpu/getGpusResources.sh",
"spark.submit.pyFiles":"/usr/lib/spark/jars/xgboost4j-spark_3.0-1.0.0-0.2.0.jar",
"spark.executor.extraLibraryPath":"/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/compat/lib:/usr/local/cuda/lib:/usr/local/cuda/lib64:/usr/lib/hadoop/lib/native:/usr/lib/hadoop-lzo/lib/native:/docker/usr/lib/hadoop/lib/native:/docker/usr/lib/hadoop-lzo/lib/native",
"spark.rapids.sql.concurrentGpuTasks":"4",
"spark.executor.resource.gpu.amount":"1",
"spark.executor.cores":"8",
"spark.task.cpus ":"1",
"spark.task.resource.gpu.amount":"0.125",
"spark.rapids.memory.pinnedPool.size":"2G",
"spark.executor.memoryOverhead":"2G",
"spark.locality.wait":"0s",
"spark.sql.shuffle.partitions":"200",
"spark.sql.files.maxPartitionBytes":"256m",
"spark.sql.adaptive.enabled":"false"
}

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I updated the FAQ.md and created subdirectories in demos and img for different CSPs. I also updated the file names. I believe Kong had already updated the config settings in the json.


In the "Edit software settings" field, copy and paste the configuration from the [EMR document](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-rapids.html). You can also create a JSON file on you own S3 bucket.

![Step 1: Step 1: Software, Configuration and Steps](pics/Rapids_EMR_GUI_1.PNG)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

some of hte configs in EMR guide are not right or should be adjusted for cluster setup.

mgzhao and others added 9 commits December 15, 2020 10:12
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
update default SPARK config, change to img folder,  correct spellings for RAPIDS, fix URL.
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
move to img folder
jlowe
jlowe previously approved these changes Dec 18, 2020
Copy link
Member

@jlowe jlowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor extra suggestion, but I'm going to go ahead and approve this now since I'll be on vacation for a bit. I don't want to totally block this for the 0.3 release. However it would be good to get a review from @sameerz or @krajendrannv before committing this.

docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
… demo folders by CSP

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
@sameerz
Copy link
Collaborator

sameerz commented Jan 3, 2021

I updated the content, and modified a few related files (FAQ.md, the databricks and gcp dataproc getting started guides) as the img locations were also cleaned up. Please review.

docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
sameerz and others added 2 commits January 4, 2021 07:05
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Jason Lowe <jlowe@nvidia.com>
@jlowe
Copy link
Member

jlowe commented Jan 4, 2021

Looks good but there are some conflicts that need to be resolved.

docs/get-started/getting-started-aws-emr.md Outdated Show resolved Hide resolved
docs/get-started/getting-started-aws-emr.md Show resolved Hide resolved
docs/get-started/preview_config.json Outdated Show resolved Hide resolved
@sameerz
Copy link
Collaborator

sameerz commented Jan 5, 2021

I think I resolved all the comments. I am not sure how to resolve the conflicting files - I followed the command line instructions, but no changes appear to have been applied or resolved.

@jlowe
Copy link
Member

jlowe commented Jan 5, 2021

I followed the command line instructions, but no changes appear to have been applied or resolved.

Did you have the latest branch-0.3 changes in your local repository when you performed the merge from branch-0.3? That would explain the lack of changes.

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>
@sameerz
Copy link
Collaborator

sameerz commented Jan 5, 2021

build

@sameerz sameerz merged commit 3d3fe74 into NVIDIA:branch-0.3 Jan 6, 2021
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Add files via upload

start guide for running Rapids on AWS EMR Spark 3.0

* Add files via upload

JSON

* Add files via upload

fix  space in "spark.task.cpus ":"1",

* Add files via upload

Mortgage ETL Spark 3.0 Notebook for AWS EMR

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

add ami id for  us-west-2 region

* Add files via upload

Updated file with EMR 6.2 official release

* Add files via upload

update pictures with EMR 6.2

* Add files via upload

minor edits

* Add files via upload

add notebook results

* Add files via upload

disable AQE for bug

* Add files via upload

* Add files via upload

Minor edits

Signed-off-by: Kong Zhao <mgzhao@gmail.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Modified based on PR feedback

update default SPARK config, change to img folder,  correct spellings for RAPIDS, fix URL.

* Delete Rapids_EMR_GUI_5.PNG

move to img folder

* Delete Rapids_EMR_GUI_4.PNG

move to img folder

* Delete Rapids_EMR_GUI_3.PNG

move to img folder

* Delete Rapids_EMR_GUI_2.PNG

* Delete Rapids_EMR_GUI_1.PNG

move to img folder

* Delete EMR_notebook_3.png

move to img folder

* Delete EMR_notebook_2.png

move to img folder

* Delete EMR_notebook_1.png

move to img folder

* Update docs for getting started and FAQ for AWS-EMR, organize img and demo folders by CSP

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Updated based on comments, updated to latest sparkconfig.png

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Sameer Raheja <sraheja@nvidia.com>
Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Add files via upload

start guide for running Rapids on AWS EMR Spark 3.0

* Add files via upload

JSON

* Add files via upload

fix  space in "spark.task.cpus ":"1",

* Add files via upload

Mortgage ETL Spark 3.0 Notebook for AWS EMR

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

add ami id for  us-west-2 region

* Add files via upload

Updated file with EMR 6.2 official release

* Add files via upload

update pictures with EMR 6.2

* Add files via upload

minor edits

* Add files via upload

add notebook results

* Add files via upload

disable AQE for bug

* Add files via upload

* Add files via upload

Minor edits

Signed-off-by: Kong Zhao <mgzhao@gmail.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Modified based on PR feedback

update default SPARK config, change to img folder,  correct spellings for RAPIDS, fix URL.

* Delete Rapids_EMR_GUI_5.PNG

move to img folder

* Delete Rapids_EMR_GUI_4.PNG

move to img folder

* Delete Rapids_EMR_GUI_3.PNG

move to img folder

* Delete Rapids_EMR_GUI_2.PNG

* Delete Rapids_EMR_GUI_1.PNG

move to img folder

* Delete EMR_notebook_3.png

move to img folder

* Delete EMR_notebook_2.png

move to img folder

* Delete EMR_notebook_1.png

move to img folder

* Update docs for getting started and FAQ for AWS-EMR, organize img and demo folders by CSP

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Update docs/get-started/getting-started-aws-emr.md

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

* Updated based on comments, updated to latest sparkconfig.png

Signed-off-by: Sameer Raheja <sraheja@nvidia.com>

Co-authored-by: Jason Lowe <jlowe@nvidia.com>
Co-authored-by: Sameer Raheja <sraheja@nvidia.com>
Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this pull request Nov 30, 2023
…IDIA#1371)

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants