experiment class for laghos #362

august-knox · 2024-09-06T17:07:16Z

bin/benchpark experiment init --dest laghos

Don't believe there's any variants that need to be specified here

dyokelson

Nice job so far! Some updates requested, let me know if you have any questions.

dyokelson · 2024-09-17T16:59:41Z

repo/laghos/application.py

+            description='number of serial refinements',
+            workloads=['laghos'])
+
+    workload_variable('rp', default='0',


Let's keep this "example" default at what it was before so default value should be 5, since we can pass in different values now via the scaling variants in experiment.

dyokelson · 2024-09-17T17:00:07Z

repo/laghos/application.py

+    workload_variable('rp', default='0',
+            description='number of parallel refinements',
+            workloads=['laghos'])
+    workload_variable('ms', default='4',


Likewise let's update this default value back to 500

dyokelson · 2024-09-17T17:21:21Z

var/exp_repo/experiments/laghos/experiment.py

+    def compute_applications_section(self):
+        variables = {}
+
+        if self.spec.satisfies("scaling=weak"):


Let's start with the scaling assuming a single node, but change problem size per rank instead, once we know we have this right we can add more nodes or make updates. So for weak scaling:
rs values: 1
num processes: 8
rp values (increases workload per rank): 1,2,3,4

Another option if you can also implement this is rs values 1,2,3 and mpi processes 1,8,64

dyokelson · 2024-09-17T17:23:52Z

var/exp_repo/experiments/laghos/experiment.py

+            variables["rs"] = ["5","6"]
+            variables["n_nodes"] = ["1","8"]
+        elif self.spec.satisfies("scaling=strong"):
+            variables["n_nodes"] = ["1","8"]


Instead of increasing nodes for strong scaling let's keep it on one node and increase MPI ranks, 1,8,64. We should lower the rs value to 1 or 2 here as well, depending on how long this takes to solve, these would ideally be less than 5 minutes to solve.

dyokelson · 2024-09-17T17:29:53Z

var/exp_repo/experiments/laghos/experiment.py

+            variables["rs"] = "5"
+        else:
+            variables["n_nodes"] = ["1","2","4","8","16","32","64","128"]    
+        variables["n_ranks"] = "{sys_cores_per_node} * {n_nodes}"


Since we're setting ranks explicitly above let's move both n_nodes and n_ranks into the else, and set this to one node, 8 ranks since this will be for the example run.

dyokelson · 2024-09-17T17:31:46Z

var/exp_repo/experiments/laghos/experiment.py

+                    "problem": {
+                       #"variables": variables,
+                        "experiments": {
+                            "laghos_{n_nodes}_{n_ranks}": {


Can you add the other parameters into the naming scheme here, so basically anything that gets passed in (rs, rp, etc) should be in here, see the naming for the kripke experiment in https://github.com/LLNL/benchpark/pull/344/files

@august-knox I enabled a dry-run test and it is failing. Please diagnose/fix.

initial commit for laghos experiment.py file

a541f5b

github-actions bot added the experiment New or modified experiment label Sep 6, 2024

adding strong and weak scaling variants

5cf7b0c

github-actions bot added the application New or modified application label Sep 12, 2024

august-knox added 2 commits September 12, 2024 15:34

changing scaling variant to experiments and adding existing exp

a6ce707

changed default exp to example

caf4694

dyokelson requested changes Sep 17, 2024

View reviewed changes

august-knox and others added 7 commits September 19, 2024 15:00

updating scaling

aa247b0

fixing error w/ application.py

304058f

Merge branch 'develop' into experiment/laghos

d5a088b

lint

ce930a5

lint

b7f60f5

Merge branch 'develop' into experiment/laghos

1545b13

Adding dry run

45cbc7f

github-actions bot added the ci Involving Project CI & Unit Tests label Sep 24, 2024

pearce8 added 2 commits September 24, 2024 13:55

Adding dry run

336de63

Removing modifier

525685c

pearce8 added ci Involving Project CI & Unit Tests changes requested Changes requested and removed ci Involving Project CI & Unit Tests application New or modified application labels Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment class for laghos #362

experiment class for laghos #362

august-knox commented Sep 6, 2024

dyokelson left a comment

dyokelson Sep 17, 2024

dyokelson Sep 17, 2024

dyokelson Sep 17, 2024

dyokelson Sep 17, 2024

dyokelson Sep 17, 2024

dyokelson Sep 17, 2024

pearce8 Sep 24, 2024

experiment class for laghos #362

Are you sure you want to change the base?

experiment class for laghos #362

Conversation

august-knox commented Sep 6, 2024

dyokelson left a comment

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

dyokelson Sep 17, 2024

Choose a reason for hiding this comment

pearce8 Sep 24, 2024

Choose a reason for hiding this comment