[Example] Add preformer for precipitation nowcasting #976

EricKing19 · 2024-08-19T01:59:49Z

PR types

Others

PR changes

Others

Describe

add Preformer model for precipitation nowcasting
add docs for Preformer
add examples for Preformer

paddle-bot · 2024-08-19T01:59:54Z

Thanks for your contribution!

CLAassistant · 2024-08-19T01:59:55Z

All committers have signed the CLA.

HydrogenSulfate

感谢提交PR，有几处小问题麻烦看一下

HydrogenSulfate · 2024-08-20T06:38:17Z

docs/zh/examples/preformer.md

+
+    ``` sh
+    # 模型训练
+    python examples/preformer/train.py


Suggested change

python examples/preformer/train.py

python train.py

HydrogenSulfate · 2024-08-20T06:38:25Z

docs/zh/examples/preformer.md

+
+    ``` sh
+    # 模型评估
+    python examples/preformer/train.py mode=eval


Suggested change

python examples/preformer/train.py mode=eval

python train.py mode=eval

HydrogenSulfate · 2024-08-20T06:38:45Z

examples/preformer/train.py

文件建议改名为main.py

HydrogenSulfate · 2024-08-20T06:39:19Z

examples/preformer/train.py

+    # set random seed for reproducibility
+    ppsci.utils.misc.set_random_seed(cfg.seed)
+    # initialize logger
+    logger.init_logger("ppsci", osp.join(cfg.output_dir, "train.log"), "info")
+


HydrogenSulfate · 2024-08-20T06:42:37Z

examples/preformer/train.py

+                "num_replicas": NUM_GPUS_PER_NODE,
+                "rank": dist.get_rank() % NUM_GPUS_PER_NODE,


这两个参数应该不需要，并且paddlescience也没有对应的处理逻辑，默认会根据环境中设置的卡数自动设置

HydrogenSulfate · 2024-08-20T06:54:11Z

ppsci/data/dataset/era5sq_dataset.py

+            mon = str("0") + mon
+        day = str(self.time_table[idxs].timetuple().tm_mday)
+        if len(day) == 1:
+            day = str("0") + day


str("0")是否可以直接写成"0"？，下同

HydrogenSulfate · 2024-08-20T06:54:48Z

ppsci/data/dataset/era5sq_dataset.py

+        r_data = np.load(
+            os.path.join(self.file_path, year, "r_" + year + mon + day + hour + ".npy")
+        )
+        t_data = np.load(
+            os.path.join(self.file_path, year, "t_" + year + mon + day + hour + ".npy")
+        )
+        u_data = np.load(
+            os.path.join(self.file_path, year, "u_" + year + mon + day + hour + ".npy")
+        )
+        v_data = np.load(
+            os.path.join(self.file_path, year, "v_" + year + mon + day + hour + ".npy")
+        )


可以直接使用f-string化简字符串拼接的写法

HydrogenSulfate · 2024-08-20T06:55:45Z

examples/preformer/conf/train.yaml

+hydra:
+  run:
+    # dynamic output directory according to running time and override name
+    dir: outputs_preformer
+  job:
+    name: ${mode} # name of logfile
+    chdir: false # keep current working directory unchanged
+    config:
+      override_dirname:
+        exclude_keys:
+          - TRAIN.checkpoint_path
+          - TRAIN.trained_model_path
+          - EVAL.trained_model_path
+          - mode
+          - output_dir
+          - log_freq
+  sweep:
+    # output directory for multirun
+    dir: ${hydra.run.dir}
+    subdir: ./
+


Suggested change

hydra:

run:

# dynamic output directory according to running time and override name

dir: outputs_preformer

job:

name: ${mode} # name of logfile

chdir: false # keep current working directory unchanged

config:

override_dirname:

exclude_keys:

- TRAIN.checkpoint_path

- TRAIN.trained_model_path

- EVAL.trained_model_path

- mode

- output_dir

- log_freq

sweep:

# output directory for multirun

dir: ${hydra.run.dir}

subdir: ./

defaults:

- ppsci_default

- TRAIN: train_default

- TRAIN/ema: ema_default

- TRAIN/swa: swa_default

- EVAL: eval_default

- INFER: infer_default

- hydra/job/config/override_dirname/exclude_keys: exclude_keys_default

- _self_

hydra:

run:

# dynamic output directory according to running time and override name

dir: outputs_preformer

job:

name: ${mode} # name of logfile

chdir: false # keep current working directory unchanged

sweep:

# output directory for multirun

dir: ${hydra.run.dir}

subdir: ./

HydrogenSulfate · 2024-08-20T06:56:07Z

examples/preformer/conf/train.yaml

+
+# model settings
+MODEL:
+  afno:


单模型可以删除afno这一层级

HydrogenSulfate · 2024-08-20T07:04:21Z

examples/preformer/conf/train.yaml

+  afno:
+    input_keys: ["input"]
+    output_keys: ["output"]
+    shape_in: [6, 12, IMG_H, IMG_W]


Suggested change

shape_in: [6, 12, IMG_H, IMG_W]

shape_in:

- 6

- 12

- ${IMG_H}

- ${IMG_W}

HydrogenSulfate · 2024-08-20T11:54:07Z

@EricKing19 标题已经修改过了，原先的merge code of upstream不太合适

liaoxin2 · 2024-08-27T02:55:54Z

docs/zh/examples/preformer.md

+案例中使用了预处理的 PEMSD4 和 PEMSD8 数据集。PEMSD4 为旧金山湾区交通数据，选取 29 条道路上 307 个传感器记录的交通数据，时间为 2018 年 1 月至 2 月。PEMSD8 为圣贝纳迪诺 8 条道路上 170 个检测器收集的交通数据，时间为 2016 年 7 月至 8 月。
+
+两个数据集均被保存为 N x T x 1 的矩阵，记录了相应交通节点与时间的流量数据，其中 N 为交通节点数量，T 为时间序列长度。两个数据集分别按照 7:2:1 划分为训练集、验证集，和测试集。案例中预先计算了流量数据的均值与标准差，用于后续的正则化操作。


该案例是关于降水的，这个数据集好像是交通的，数据集与代码不一致

merge code of upstream

b7e0216

paddle-bot bot added the contributor label Aug 19, 2024

HydrogenSulfate requested changes Aug 20, 2024

View reviewed changes

HydrogenSulfate changed the title ~~merge code of upstream~~ [Example] Add preformer for precipitation nowcasting Aug 20, 2024

luotao1 self-assigned this Aug 21, 2024

liaoxin2 reviewed Aug 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Example] Add preformer for precipitation nowcasting #976

[Example] Add preformer for precipitation nowcasting #976

EricKing19 commented Aug 19, 2024

paddle-bot bot commented Aug 19, 2024

CLAassistant commented Aug 19, 2024 •

edited

Loading

HydrogenSulfate left a comment

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate Aug 20, 2024

HydrogenSulfate commented Aug 20, 2024

liaoxin2 Aug 27, 2024

	python examples/preformer/train.py mode=eval
	python train.py mode=eval

		"num_replicas": NUM_GPUS_PER_NODE,
		"rank": dist.get_rank() % NUM_GPUS_PER_NODE,

		案例中使用了预处理的 PEMSD4 和 PEMSD8 数据集。PEMSD4 为旧金山湾区交通数据，选取 29 条道路上 307 个传感器记录的交通数据，时间为 2018 年 1 月至 2 月。PEMSD8 为圣贝纳迪诺 8 条道路上 170 个检测器收集的交通数据，时间为 2016 年 7 月至 8 月。

		两个数据集均被保存为 N x T x 1 的矩阵，记录了相应交通节点与时间的流量数据，其中 N 为交通节点数量，T 为时间序列长度。两个数据集分别按照 7:2:1 划分为训练集、验证集，和测试集。案例中预先计算了流量数据的均值与标准差，用于后续的正则化操作。

[Example] Add preformer for precipitation nowcasting #976

Are you sure you want to change the base?

[Example] Add preformer for precipitation nowcasting #976

Conversation

EricKing19 commented Aug 19, 2024

PR types

PR changes

Describe

paddle-bot bot commented Aug 19, 2024

CLAassistant commented Aug 19, 2024 • edited Loading

HydrogenSulfate left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HydrogenSulfate commented Aug 20, 2024

Choose a reason for hiding this comment

CLAassistant commented Aug 19, 2024 •

edited

Loading