使用 PyTorch 进行时间序列预测 - 数据加载器、归一化、指标和模型

这些详情尚未由 PyPI 验证

项目描述

PyTorch Forecasting

PyTorch 预测 是一个基于 PyTorch 的包，用于使用最先进的深度学习架构进行预测。它提供了一个高级 API，并使用 PyTorch Lightning 在 GPU 或 CPU 上扩展训练，具有自动记录功能。

	文档 · 教程 · 发布说明
开源
社区
CI/CD
代码

我们关于 Towards Data Science 的文章介绍了这个包并提供了一些背景信息。

PyTorch 预测旨在通过神经网络简化最先进的时间序列预测，无论是针对现实案例还是研究。目标是提供一个具有最大灵活性的高级 API，并为初学者提供合理的默认值。具体来说，该包提供

一个时间序列数据集类，该类抽象处理变量转换、缺失值、随机子采样、多个历史长度等。
一个基础模型类，提供时间序列模型的基本训练，以及Tensorboard日志记录和通用可视化，如实际值与预测值比较图和依赖图。
针对时间序列预测进行了增强的多层神经网络架构，适用于实际部署，并带有内置的解释能力。
多步预测的时间序列指标。
使用optuna进行超参数调整。

该软件包基于pytorch-lightning构建，允许直接在CPU和单个或多个GPU上训练。

安装

如果您正在使用Windows，您需要首先使用以下命令安装PyTorch：

pip install torch -f https://download.pytorch.org/whl/torch_stable.html.

否则，您可以继续使用以下命令：

pip 安装 pytorch-forecasting

或者，您可以通过conda安装该包：

conda install pytorch-forecasting pytorch -c pytorch>=1.7 -c conda-forge

PyTorch Forecasting现在从conda-forge频道安装，而PyTorch是从pytorch频道安装的。

要使用MQF2损失（多元分位数损失），还需要安装pip install pytorch-forecasting[mqf2]

文档

请访问https://pytorch-forecasting.readthedocs.io以阅读详细的教程。

可用模型

文档提供了可用模型的比较。

用于可解释多步时间序列预测的时序融合转换器，在基准测试中比Amazon的DeepAR性能高出36-69%。
N-BEATS：用于可解释时间序列预测的神经网络基础扩展分析，如果用作集成，在M4比赛中优于包括传统统计方法集成在内的所有其他方法。M4比赛可以说是单变量时间序列预测最重要的基准。
N-HiTS：用于时间序列预测的神经网络分层插值支持协变量，并且一直战胜N-BEATS。它也特别适合进行长周期预测。
DeepAR：使用自回归循环网络的概率预测是最受欢迎的预测算法之一，通常用作基线。
用于基线的简单标准网络：LSTM和GRU网络以及解码器上的MLP。
始终预测最新已知值的基线模型。

有关实现新模型或其他自定义组件的信息，请参阅如何实现新模型教程。它涵盖了基本和高级架构。

使用示例

网络可以使用PyTorch Lightning Trainer在pandas DataFrame上进行训练，这些DataFrame首先转换为TimeSeriesDataSet。

# imports for training
import lightning.pytorch as pl
from lightning.pytorch.loggers import TensorBoardLogger
from lightning.pytorch.callbacks import EarlyStopping, LearningRateMonitor
# import dataset, network to train and metric to optimize
from pytorch_forecasting import TimeSeriesDataSet, TemporalFusionTransformer, QuantileLoss
from lightning.pytorch.tuner import Tuner

# load data: this is pandas dataframe with at least a column for
# * the target (what you want to predict)
# * the timeseries ID (which should be a unique string to identify each timeseries)
# * the time of the observation (which should be a monotonically increasing integer)
data = ...

# define the dataset, i.e. add metadata to pandas dataframe for the model to understand it
max_encoder_length = 36
max_prediction_length = 6
training_cutoff = "YYYY-MM-DD"  # day for cutoff

training = TimeSeriesDataSet(
    data[lambda x: x.date <= training_cutoff],
    time_idx= ...,  # column name of time of observation
    target= ...,  # column name of target to predict
    group_ids=[ ... ],  # column name(s) for timeseries IDs
    max_encoder_length=max_encoder_length,  # how much history to use
    max_prediction_length=max_prediction_length,  # how far to predict into future
    # covariates static for a timeseries ID
    static_categoricals=[ ... ],
    static_reals=[ ... ],
    # covariates known and unknown in the future to inform prediction
    time_varying_known_categoricals=[ ... ],
    time_varying_known_reals=[ ... ],
    time_varying_unknown_categoricals=[ ... ],
    time_varying_unknown_reals=[ ... ],
)

# create validation dataset using the same normalization techniques as for the training dataset
validation = TimeSeriesDataSet.from_dataset(training, data, min_prediction_idx=training.index.time.max() + 1, stop_randomization=True)

# convert datasets to dataloaders for training
batch_size = 128
train_dataloader = training.to_dataloader(train=True, batch_size=batch_size, num_workers=2)
val_dataloader = validation.to_dataloader(train=False, batch_size=batch_size, num_workers=2)

# create PyTorch Lighning Trainer with early stopping
early_stop_callback = EarlyStopping(monitor="val_loss", min_delta=1e-4, patience=1, verbose=False, mode="min")
lr_logger = LearningRateMonitor()
trainer = pl.Trainer(
    max_epochs=100,
    accelerator="auto",  # run on CPU, if on multiple GPUs, use strategy="ddp"
    gradient_clip_val=0.1,
    limit_train_batches=30,  # 30 batches per epoch
    callbacks=[lr_logger, early_stop_callback],
    logger=TensorBoardLogger("lightning_logs")
)

# define network to train - the architecture is mostly inferred from the dataset, so that only a few hyperparameters have to be set by the user
tft = TemporalFusionTransformer.from_dataset(
    # dataset
    training,
    # architecture hyperparameters
    hidden_size=32,
    attention_head_size=1,
    dropout=0.1,
    hidden_continuous_size=16,
    # loss metric to optimize
    loss=QuantileLoss(),
    # logging frequency
    log_interval=2,
    # optimizer parameters
    learning_rate=0.03,
    reduce_on_plateau_patience=4
)
print(f"Number of parameters in network: {tft.size()/1e3:.1f}k")

# find the optimal learning rate
res = Tuner(trainer).lr_find(
    tft, train_dataloaders=train_dataloader, val_dataloaders=val_dataloader, early_stop_threshold=1000.0, max_lr=0.3,
)
# and plot the result - always visually confirm that the suggested learning rate makes sense
print(f"suggested learning rate: {res.suggestion()}")
fig = res.plot(show=True, suggest=True)
fig.show()

# fit the model on the data - redefine the model with the correct learning rate if necessary
trainer.fit(
    tft, train_dataloaders=train_dataloader, val_dataloaders=val_dataloader,
)

项目详情

这些详情尚未由 PyPI 验证

发布历史发布通知 | RSS源

本版本

1.1.1

2024年9月9日

1.1.0

2024年9月8日

1.0.0

2023年4月10日

0.10.3

2022年9月7日

0.10.2

2022年5月23日

0.10.1

2022年3月24日

0.10.0

2022年3月23日

0.9.2

2021年11月29日

0.9.1

2021年9月26日

0.9.0

2021年6月4日

0.8.5

2021年4月27日

0.8.4

2021年3月7日

0.8.3

2021年1月31日

0.8.2

2021年1月12日

0.8.1

2021年1月10日

0.8.0

2021年1月3日

0.7.1

2020年12月7日

0.7.0

2020年12月3日

0.6.1

2020年11月24日

0.6.0

2020年11月10日

0.5.3

2020年10月31日

0.5.2

2020年10月18日

0.5.1

2020年10月14日

0.5.0

2020年10月12日

0.4.1

2020年8月1日

0.4.0

2020年9月28日

0.3.1

2020年9月13日

0.3.0

2020年9月2日

0.2.4

2020年8月26日

0.2.3

2020年8月23日

0.2.2

2020年8月23日

0.2.1

2020年8月23日

0.2.0

2020年8月16日

0.1.2

2020年7月26日

0.1.1

2020年7月26日

0.1.0

2020年7月3日

下载文件

下载适用于您的平台文件。如果您不确定选择哪个，请了解有关安装包的更多信息。

源分布

pytorch_forecasting-1.1.1.tar.gz (152.7 kB 查看哈希值)

上传时间 2024年9月9日 源

构建分布

pytorch_forecasting-1.1.1-py3-none-any.whl (177.6 kB 查看哈希值)

上传时间 2024年9月9日 Python 3

pytorch_forecasting-1.1.1.tar.gz的哈希值

pytorch_forecasting-1.1.1.tar.gz的哈希值
算法	哈希摘要
SHA256	`41e5449c4abb54e54bd431408943f5be602b0d0e05973c04aef7d409062cdd78`
MD5	`2ce0227596efb5465b608f50baf8dd7a`
BLAKE2b-256	`b9bcc4210627dff83e31c3b6638e3853ab28c2b5cf09c957b08d07a420e0a9ff`

pytorch_forecasting-1.1.1-py3-none-any.whl的哈希值

pytorch_forecasting-1.1.1-py3-none-any.whl的哈希值
算法	哈希摘要
SHA256	`7af55159b2e6c74a0e5c377477b8028bca20db03f0a159719294a3293f9ecbfc`
MD5	`732e9d95ded5f4c1106e4c4bc0fc7d66`
BLAKE2b-256	`37abb5ef6ec672f9ad65f0e43447a758f22cf6a40ec5799587374921e6f6cc68`

pytorch-forecasting 1.1.1

导航

已验证详情

维护者

未验证详情

元数据

分类

项目描述

安装

文档

可用模型

使用示例

项目详情

已验证详情

维护者

未验证详情

元数据

分类

发布历史发布通知 | RSS源

下载文件

源分布

构建分布

pytorch-forecasting 1.1.1

导航

已验证详情

维护者

未验证详情

元数据

分类

项目描述

安装

文档

可用模型

使用示例

项目详情

已验证详情

维护者

未验证详情

元数据

分类

发布历史 发布通知 | RSS源

下载文件

源分布

构建分布

发布历史发布通知 | RSS源