0% found this document useful (0 votes)

11 views

CH 10

Uploaded by

mengcenfay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

CH 10

Uploaded by

mengcenfay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

时间序列分析-2023秋季课程

10 时间序列分析进阶深度学习方法
深度学习模块和训练方式的改进
神经网络时间序列预测模型

Transformer
InFormer

CNN AutoFormer
FedFormer
TCN
RNN ……
DeepAR,
MQ-RNN
MLP
NHITS
Linear
神经网络时间序列预测模型

Transformer
InFormer

CNN AutoFormer
FedFormer
TCN
RNN Transformer
DeepAR, PathTST
MQ-RNN CrossFormer
MLP CNN
iTransformer
NHITS MICN
RNN
Linear TimesNet
SegRNN
Modern TCN
MLP
MTS-Mixers
Linear TSMixer
DLinear
Rlinear
概要

1. 时间序列模型的
2. MLP模型 3. RNN模型
预处理

4. CNN模型 5. 注意力模型 ……
概要

3. Channel
1.线性模型 2. Normalization
Independent
DLinear
▪ 基于长为𝐿𝐿历史数据，预测长为𝑇𝑇未来数据
▪ 历史数据：𝒀𝒀old ∈ ℝ𝐿𝐿×𝑑𝑑
▪ 目标数据：𝒀𝒀𝑛𝑛𝑛𝑛𝑛𝑛 ∈ ℝ𝑇𝑇×𝑑𝑑
▪ 模型：𝑾𝑾 ∈ ℝ𝑇𝑇×𝐿𝐿
𝒀𝒀𝑛𝑛𝑛𝑛𝑛𝑛 = 𝑊𝑊 𝒀𝒀𝑜𝑜𝑜𝑜𝑜𝑜

▪ LTSF-Linear

▪ Dlinear：分解为Trend以及Remaining，分别做
Linear

▪ Nlienar: 在Naive1的基础上做Linear

Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu. Are Transformers Effective for Time Series Forecasting? AAAI 2023.
Properties of The Linear Model
▪ The time-step-dependent linear model, despite its simplicity, proves to be highly effective in
modeling temporal patterns.

▪ Conversely, even though recurrent or attention architectures have high representational capacity,
achieving time-step independence is challenging for them. They usually overfit on the data
instead of solely considering the positions.

Si-An Chen et al. TSMixer: An all-MLP Architecture for Time Series Forecasting. TMLR (2023)
Properties of The Linear Model
▪ A single linear layer can also effectively learn periodic patterns

▪ 𝑐𝑐 channels and 𝑛𝑛 time steps, predict next 𝑚𝑚 steps

linear mapping can predict periodic signals when the length of the input historical sequence
is not less than the period, but that is not a unique solution.

Zhe Li et al. Revisiting Long-term Time Series Forecasting: An Investigation on Linear Mapping. CoRR abs/2305.10721 (2023)
Properties of The Linear Model
▪ A single linear layer can also effectively learn periodic patterns

▪ 𝑐𝑐 channels and 𝑛𝑛 time steps, predict next 𝑚𝑚 steps

the linear model fits seasonality well but performs poorly on the trend
Properties of The Linear Model
▪ The linear model fits seasonality well but performs poorly on the trend
Reversible Instance Normalization (RevIN)

The predictions of the baselines are inaccurately (a) shifted and (b) scaled

Taesung Kim et al. Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift. ICLR 2022
RevIN
The (a-3) non-stationary information includes statistical properties from
the input data: mean 𝜇𝜇, variance 𝜎𝜎 2 , and learnable affine parameters 𝛾𝛾, 𝛽𝛽.

(a-1) and (a-2) are

symmetrically structured
to remove (a-3)
nonstationary information
from one layer and
restore it on the other
layer. Here, RevIN is
applied to the input and
output layers.

The normalization layer transforms the (b-1) original data Using 𝑥𝑥ˆ , the model predicts the future values 𝑦𝑦˜
distribution into a (b-2) mean-centered distribution, where the following the (b-3) distribution where non-stationary
distribution discrepancy between different instances is reduced. information is eliminated.
Taesung Kim et al. Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift. ICLR 2022
RevIN
▪ Let 𝐾𝐾, 𝑇𝑇𝑥𝑥 and 𝑇𝑇𝑦𝑦 denote the number of variables, the input sequence length, and the model
prediction length, 𝑥𝑥 (𝑖𝑖) ∈ ℝ𝐾𝐾×𝑇𝑇𝑥𝑥 → 𝑦𝑦 𝑖𝑖
∈ ℝ𝐾𝐾×𝑇𝑇𝑦𝑦
▪ For 𝑥𝑥 (𝑖𝑖) ,
𝑇𝑇𝑥𝑥 𝑇𝑇𝑥𝑥
(𝑖𝑖) 1 (𝑖𝑖) (𝑖𝑖) 1 (𝑖𝑖) (𝑖𝑖) 2
𝔼𝔼𝑡𝑡 𝑥𝑥𝑘𝑘𝑘𝑘 = � 𝑥𝑥𝑘𝑘𝑘𝑘 and Var 𝑥𝑥𝑘𝑘𝑘𝑘 = � 𝑥𝑥𝑘𝑘𝑘𝑘 − 𝔼𝔼𝑡𝑡 𝑥𝑥𝑘𝑘𝑘𝑘
𝑇𝑇𝑥𝑥 𝑇𝑇𝑥𝑥
𝑗𝑗=1 𝑗𝑗=1
▪ Normalize the input data 𝑥𝑥 (𝑖𝑖) as
(𝑖𝑖) (𝑖𝑖)
(𝑖𝑖)
𝑥𝑥𝑘𝑘𝑘𝑘 − 𝔼𝔼𝑡𝑡 𝑥𝑥𝑘𝑘𝑘𝑘
𝑥𝑥ˆ 𝑘𝑘𝑘𝑘 = 𝛾𝛾𝑘𝑘 + 𝛽𝛽𝑘𝑘
(𝑖𝑖)
Var 𝑥𝑥𝑘𝑘𝑘𝑘 + 𝜖𝜖
▪ Denormalize the model output 𝑦𝑦� (𝑖𝑖)
(𝑖𝑖)
(𝑖𝑖) (𝑖𝑖) 𝑦𝑦�𝑘𝑘𝑘𝑘 − 𝛽𝛽𝑘𝑘 (𝑖𝑖)
𝑦𝑦�𝑘𝑘𝑘𝑘 = Var 𝑥𝑥𝑘𝑘𝑘𝑘 + 𝜖𝜖 ⋅ + 𝔼𝔼𝑡𝑡 𝑥𝑥𝑘𝑘𝑘𝑘
𝛾𝛾𝑘𝑘
Slice-level Adaptive Normalization (SAN)

Zhiding Liu et al. Adaptive Normalization for Non-stationary Time Series Forecasting: A Temporal Slice Perspective. NeurIPS 2023
Slice-level Adaptive Normalization (SAN)

normalizes every slice of the

original input sequence by their
individual statistics as
𝑖𝑖 1
�𝑗𝑗 = 𝑖𝑖
𝒙𝒙 ⋅ 𝒙𝒙𝑗𝑗𝑖𝑖 − 𝜇𝜇𝑗𝑗𝑖𝑖
𝜎𝜎𝑗𝑗 + 𝜖𝜖

� 𝑖𝑖 = 𝑾𝑾1 ∗ ML P 𝝁𝝁𝑖𝑖 − 𝜌𝜌𝑖𝑖 , 𝒙𝒙

𝝁𝝁 �𝑖𝑖 − 𝜌𝜌𝑖𝑖 + 𝑾𝑾2 ∗ 𝜌𝜌𝑖𝑖 ,

� 𝑖𝑖 = ML P 𝝈𝝈𝑖𝑖 , 𝒙𝒙
𝝈𝝈 �𝑖𝑖

split the internal output 𝑦𝑦� 𝑖𝑖 into 𝐾𝐾

𝑖𝑖 𝐾𝐾
non-overlapping slice 𝑦𝑦�𝑗𝑗 , then
𝑗𝑗=1
denormalize
�𝑖𝑖𝑗𝑗 = 𝒚𝒚
𝒚𝒚 �𝑖𝑖𝑗𝑗 ∗ 𝝈𝝈
�𝑗𝑗𝑖𝑖 + 𝜖𝜖 + 𝝁𝝁
� 𝑖𝑖𝑗𝑗 .
RevIN and Linear Classifier
▪ Turning trend into seasonality

▪ Directly applying normalization to input data may erase this statistical information and lead
to poor predictions;
▪ It is challenging to fit trend changes solely using a linear layer. Applying batch normalization
even induces worse results. Disentangling the simulated time series also does not work.

Zhe Li et al. Revisiting Long-term Time Series Forecasting: An Investigation on Linear Mapping. CoRR abs/2305.10721 (2023)
RevIN and Linear Classifier

▪ For the seasonal signal, RevIN scales the range but does not change the periodicity.
▪ For the trend signal, RevIN scales each segment into the same range and exhibits periodic
patterns. RevIN is capable of turning some trends into seasonality, making models better learn or
memorize trend terms.
RevIN and Linear Classifier

▪ RevIN converts continuously changing trends into multiple segments with a fixed and
similar trend, demonstrating periodic characteristics.
▪ As a result, errors in trend prediction caused by accumulated timesteps in the past can be
alleviated, leading to more accurate forecasting results.
Channel Independent

Lu Han, Han-Jia Ye, De-Chuan Zhan. The Capacity and Robustness Trade-off: Revisiting the Channel Independent Strategy for
Multivariate Time Series Forecasting. CoRR abs/2304.05206 (2023)
MAE Comparison
The Framework

▪ Normalization

▪ Temporal Module

▪ Channel Independent Training

The Influence of the Encoder

▪ Even using a randomly initialized temporal feature extractor with untrained parameters can
induce competitive, even better forecasting results.
RLinear
▪ RLinear : RevIN + MLP + CI

▪ RMLP: RevIN + MLP + CI

Zhe Li et al. Revisiting Long-term Time Series Forecasting: An Investigation on Linear Mapping. CoRR abs/2305.10721 (2023)
RLinear

▪ † indicates the temporal feature extractor with fixed random weights

TSMixer
▪ To better leverage cross-variate information, TSMixer contains interleaving time-mixing and
feature-mixing MLPs to aggregate information.

▪ The time-mixing MLPs are shared across all features and the feature-mixing MLPs are shared
across all of the time steps.

Si-An Chen et al. TSMixer: An all-MLP Architecture for Time Series Forecasting. TMLR (2023)
▪ Time-mixing MLP

▪ Feature-mixing MLP

▪ Temporal Projection

▪ Residual Connections

▪ Normalization
PathTST
▪ Multivariate time series data is divided into different channels. They share the same
Transformer backbone, but the forward processes are independent

(𝑖𝑖)
Forward Process. Denote a 𝑖𝑖 -th univariate series of length 𝐿𝐿 starting at time index 1 as 𝒙𝒙1:𝐿𝐿 =
(𝑖𝑖) (𝑖𝑖)
𝑥𝑥1 , … , 𝑥𝑥𝐿𝐿 where 𝑖𝑖 = 1, … , 𝑀𝑀. The input 𝒙𝒙1 , … , 𝒙𝒙𝐿𝐿 is split to 𝑀𝑀 univariate series 𝒙𝒙(𝑖𝑖) ∈ ℝ1×𝐿𝐿 , where each of
them is fed independently into the Transformer backbone. Then the Transformer backbone will provide
(𝑖𝑖) (𝑖𝑖)
�(𝑖𝑖) = 𝑥𝑥�𝐿𝐿+1 , … , 𝑥𝑥�𝐿𝐿+𝑇𝑇 ∈ ℝ1×𝑇𝑇 .
prediction results 𝒙𝒙
Yuqi Nie et al. A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. ICLR 2023
PathTST

Patching. Each input univariate time series 𝒙𝒙(𝑖𝑖) is first divided into patches which can be either
overlapped or non-overlapped. Denote the patch length as 𝑃𝑃 and the stride, then the patching process will
(𝑖𝑖)
generate the a sequence of patches 𝒙𝒙𝑝𝑝 ∈ ℝ𝑃𝑃×𝑁𝑁 where 𝑁𝑁 is the number of patches.

With the use of patches, the number of input tokens can be reduced.
Positional Encoding. A learnable additive position encoding 𝑊𝑊𝑝𝑝𝑝𝑝𝑝𝑝 ∈ ℝ𝐷𝐷×𝑁𝑁 is applied to monitor the
temporal order of patches.
Yuqi Nie et al. A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. ICLR 2023
PathTST
PathTST

Yunhao Zhang, Junchi Yan. Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting. ICLR 2023
CrossFormer
Two-Stage Attention Directly using MSA in Cross-Dimension Stage to build the
D-to-D connection results in 𝑂𝑂(𝐷𝐷 2 ) complexity

Router mechanism: a small fixed number (c) of “routers” gather information from all dimensions
and then distribute the gathered information. The complexity is reduced to 𝑂𝑂(2𝑐𝑐𝑐𝑐) = 𝑂𝑂(𝐷𝐷).
iTransformer

Yong Liu et al. iTransformer: Inverted Transformers Are Effective for Time Series Forecasting. CoRR abs/2310.06625 (2023)
iTransformer

▪ Transformer treats time series as the natural language but the time aligned embedding may
bring about risks in multi-dimensional series. The problem can be alleviated by expanding the
receptive field.

▪ Patching can be more fine-grained, it also brings higher computational complexity and the
potential interaction noise between time-unaligned patches.
iTransformer

Cristian Challu et al. NHITS: Neural Hierarchical Interpolation for Time Series Forecasting. AAAI 2023: 6989-6997
iTransformer
iTransformer
Modern TCN

ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis. ICLR 2024 submission.
SegRNN

Shengsheng Lin et al. SegRNN: Segment Recurrent Neural Network for Long-Term Time Series Forecasting. CoRR abs/2308.11200 (2023)

Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
Financial Time Series Forecasting Using CNN and Transformer
No ratings yet
Financial Time Series Forecasting Using CNN and Transformer
4 pages
XLSTMTime - Long-term Time Series Forecasting With XLSTM
No ratings yet
XLSTMTime - Long-term Time Series Forecasting With XLSTM
13 pages
Are Transformers Effective For Time Series Forecasting?
No ratings yet
Are Transformers Effective For Time Series Forecasting?
15 pages
Multivariate Time Series Forecasting Final 3rd Sem
No ratings yet
Multivariate Time Series Forecasting Final 3rd Sem
22 pages
A Systematic Review For Transformer-Based Long-Term Series Forecasting
No ratings yet
A Systematic Review For Transformer-Based Long-Term Series Forecasting
30 pages
Are Transformers Effective for Time Series Forecasting?
No ratings yet
Are Transformers Effective for Time Series Forecasting?
8 pages
Are Transformers Effective For Time Series Forecasting?
No ratings yet
Are Transformers Effective For Time Series Forecasting?
8 pages
T: I T A E T S F: I Ransformer Nverted Ransformers RE Ffective For IME Eries Orecasting
No ratings yet
T: I T A E T S F: I Ransformer Nverted Ransformers RE Ffective For IME Eries Orecasting
19 pages
632_iTransformer_Inverted_Tran
No ratings yet
632_iTransformer_Inverted_Tran
25 pages
T: I T A E T S F: I Ransformer Nverted Ransformers RE Ffective For IME Eries Orecasting
No ratings yet
T: I T A E T S F: I Ransformer Nverted Ransformers RE Ffective For IME Eries Orecasting
25 pages
Enhancing The Locality and Breaking The Memory Bottleneck of Transformer On Time Series Forecasting Paper
No ratings yet
Enhancing The Locality and Breaking The Memory Bottleneck of Transformer On Time Series Forecasting Paper
11 pages
s13042-025-02560-w
No ratings yet
s13042-025-02560-w
34 pages
2305.12095
No ratings yet
2305.12095
39 pages
1.shiyang Li - Enhance Locality and Break The Memory Bottleneck
No ratings yet
1.shiyang Li - Enhance Locality and Break The Memory Bottleneck
14 pages
ouyang2017
No ratings yet
ouyang2017
13 pages
XLSTMTime Long-term Time Series Forecasting With XLSTM
No ratings yet
XLSTMTime Long-term Time Series Forecasting With XLSTM
13 pages
Crossformer - Transformer Utilizing Cross-Dimension Dependency For Multivariate Time Series Forecasting
No ratings yet
Crossformer - Transformer Utilizing Cross-Dimension Dependency For Multivariate Time Series Forecasting
21 pages
2304.08424v1
No ratings yet
2304.08424v1
18 pages
Amir ND Time Series Prediction
No ratings yet
Amir ND Time Series Prediction
8 pages
An Analysis of Linear Time Series Forecasting Models
No ratings yet
An Analysis of Linear Time Series Forecasting Models
20 pages
Book 7
No ratings yet
Book 7
35 pages
FreDo - Frequency Domain-Based Long-Term Time Series Forecasting
No ratings yet
FreDo - Frequency Domain-Based Long-Term Time Series Forecasting
12 pages
ssrn-4165241
No ratings yet
ssrn-4165241
28 pages
FilterNet Harnessing Frequency Filters for Time Series Forecasting
No ratings yet
FilterNet Harnessing Frequency Filters for Time Series Forecasting
20 pages
Learning Deep Time-Index Models For Time Series Forecasting
No ratings yet
Learning Deep Time-Index Models For Time Series Forecasting
21 pages
An Artificial Neural Network P D Q Model For Times
No ratings yet
An Artificial Neural Network P D Q Model For Times
12 pages
Time Series Forecasting With Deep Learning: A Survey: Research
No ratings yet
Time Series Forecasting With Deep Learning: A Survey: Research
13 pages
A Time Series Is Worth 64 Words - Long-Term Forecasting With Transformers
No ratings yet
A Time Series Is Worth 64 Words - Long-Term Forecasting With Transformers
24 pages
Building Trend Fuzzy Granulation-Based LSTM Recurrent Neural Network for Long-Term Time-Series Forecasting
No ratings yet
Building Trend Fuzzy Granulation-Based LSTM Recurrent Neural Network for Long-Term Time-Series Forecasting
15 pages
FEDformer - Frequency Enhanced Decomposed Transformer For Long-Term Series Forecasting
No ratings yet
FEDformer - Frequency Enhanced Decomposed Transformer For Long-Term Series Forecasting
19 pages
TSMixer
No ratings yet
TSMixer
24 pages
A Joint Time-Frequency Domain Transformer For Multivariate Time Series Forecasting
No ratings yet
A Joint Time-Frequency Domain Transformer For Multivariate Time Series Forecasting
33 pages
Non - Stationary Former
No ratings yet
Non - Stationary Former
21 pages
Time Machine
No ratings yet
Time Machine
10 pages
Timemachine: A Time Series Is Worth 4 Mambas For Long-Term Forecasting
No ratings yet
Timemachine: A Time Series Is Worth 4 Mambas For Long-Term Forecasting
10 pages
s11063-024-11656-3
No ratings yet
s11063-024-11656-3
25 pages
Conditional Time Series Forecasting With Convolutional Neural Networks
No ratings yet
Conditional Time Series Forecasting With Convolutional Neural Networks
22 pages
2410.11674v1
No ratings yet
2410.11674v1
11 pages
NARX Model1
No ratings yet
NARX Model1
11 pages
Adversarial Sparse Transformer For Time Series Forecasting
No ratings yet
Adversarial Sparse Transformer For Time Series Forecasting
11 pages
Evaluation of Deep Learning Models For Multi-Step Ahead Time Series Prediction
No ratings yet
Evaluation of Deep Learning Models For Multi-Step Ahead Time Series Prediction
22 pages
A Review of Deep Learning Models For Time Series Prediction
No ratings yet
A Review of Deep Learning Models For Time Series Prediction
16 pages
Improving Long-Term Multivariate Time Series Forecasting With A Seasonal-Trend Decomposition-Based 2-Dimensional Temporal Convolution Dense Network
No ratings yet
Improving Long-Term Multivariate Time Series Forecasting With A Seasonal-Trend Decomposition-Based 2-Dimensional Temporal Convolution Dense Network
13 pages
Autoformer Nips21
No ratings yet
Autoformer Nips21
12 pages
Pplied Time Series Ransfer Learning
No ratings yet
Pplied Time Series Ransfer Learning
4 pages
Transformers in Time Series - A Survey
No ratings yet
Transformers in Time Series - A Survey
9 pages
University of Computer Studies, Mandalay (UCSM)
No ratings yet
University of Computer Studies, Mandalay (UCSM)
12 pages
TTM research paper
No ratings yet
TTM research paper
32 pages
GCformer - An Efficient Framework For Accurate and Scalable Long-Term Multivariate Time Series Forecasting
No ratings yet
GCformer - An Efficient Framework For Accurate and Scalable Long-Term Multivariate Time Series Forecasting
10 pages
Autoformer
No ratings yet
Autoformer
20 pages
LSTM and Transformer
No ratings yet
LSTM and Transformer
4 pages
Bryan Lim
No ratings yet
Bryan Lim
145 pages
2724 Reversible Instance Normalizat
No ratings yet
2724 Reversible Instance Normalizat
25 pages
Multi-Step Ahead Time Series Forecasting For Different Data Patterns Based On LSTM Recurrent Neural Network
No ratings yet
Multi-Step Ahead Time Series Forecasting For Different Data Patterns Based On LSTM Recurrent Neural Network
6 pages
Time Series Forecasting of Petroleum
No ratings yet
Time Series Forecasting of Petroleum
11 pages
Zhang 2012
No ratings yet
Zhang 2012
18 pages
MixMamba Time Series Modeling With Adaptive Expertise
No ratings yet
MixMamba Time Series Modeling With Adaptive Expertise
13 pages
Time Series Forecasting of Petroleum Pro
No ratings yet
Time Series Forecasting of Petroleum Pro
11 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mindfulness Journal
100% (10)
Mindfulness Journal
12 pages
Atomic Structure and Amount of Substance Q
No ratings yet
Atomic Structure and Amount of Substance Q
30 pages
My Teamcenter
No ratings yet
My Teamcenter
420 pages
Hero Kids - Space - Tyranny's Shadow-20200225
No ratings yet
Hero Kids - Space - Tyranny's Shadow-20200225
24 pages
Cyber Security Report
No ratings yet
Cyber Security Report
528 pages
Peter Kraljic
No ratings yet
Peter Kraljic
2 pages
Working With Ultra High Vacuum (UHV)
No ratings yet
Working With Ultra High Vacuum (UHV)
34 pages
Factors Influencing The Productivity of Solar Still
No ratings yet
Factors Influencing The Productivity of Solar Still
11 pages
Key Concepts: 2.1 Introduction To Hyper Text Markup Language (HTML)
No ratings yet
Key Concepts: 2.1 Introduction To Hyper Text Markup Language (HTML)
66 pages
EC109 Signals and System Analysis: L-T-P: 3-1-0 Total 42 Lectures
No ratings yet
EC109 Signals and System Analysis: L-T-P: 3-1-0 Total 42 Lectures
2 pages
An Overview of Dandruff and Novel Formulations As A Treatment Strategy
No ratings yet
An Overview of Dandruff and Novel Formulations As A Treatment Strategy
16 pages
House Plans, 2 Story House Plans, 40 X 40 House Plans, 100121
No ratings yet
House Plans, 2 Story House Plans, 40 X 40 House Plans, 100121
6 pages
Implications For GAAP From An Analysis of Positive Research in Accounting
No ratings yet
Implications For GAAP From An Analysis of Positive Research in Accounting
41 pages
Behavior Modification Techniques
No ratings yet
Behavior Modification Techniques
34 pages
Chapter 5 - Road Materials
No ratings yet
Chapter 5 - Road Materials
72 pages
Critical Equipment and Technologies Developed by DRDO For Combating COVID-19 Pandemic 09/06/2020
No ratings yet
Critical Equipment and Technologies Developed by DRDO For Combating COVID-19 Pandemic 09/06/2020
76 pages
The Seventy Weeks of Daniel: Enemies of Literal Interpretation
No ratings yet
The Seventy Weeks of Daniel: Enemies of Literal Interpretation
5 pages
Sexuality Versus Purity in Dracula: The Corruption of Victorian Morals
100% (1)
Sexuality Versus Purity in Dracula: The Corruption of Victorian Morals
4 pages
Study Scheme & Curriculum - Architectural Assistantship - Batch 2018 Onwards
No ratings yet
Study Scheme & Curriculum - Architectural Assistantship - Batch 2018 Onwards
128 pages
EF3e Elem Filetest 5a
No ratings yet
EF3e Elem Filetest 5a
7 pages
Types of Maps
No ratings yet
Types of Maps
13 pages
Azrin & Lindsley 1956 - The Reinf of Cooperation Between Children - LIMPO
No ratings yet
Azrin & Lindsley 1956 - The Reinf of Cooperation Between Children - LIMPO
3 pages
Spirited Away Finally Essay
No ratings yet
Spirited Away Finally Essay
9 pages
500 Words Essays About How To Become Successfull
No ratings yet
500 Words Essays About How To Become Successfull
2 pages
Prophetic Focus - FEB 2024
No ratings yet
Prophetic Focus - FEB 2024
1 page
System System Basis Angle Structures Classification Scheie 1957
No ratings yet
System System Basis Angle Structures Classification Scheie 1957
2 pages
HATDOG4
No ratings yet
HATDOG4
35 pages
28.finding The Common Multiples and The Least Common Multiple (LCM) of Two Numbers
No ratings yet
28.finding The Common Multiples and The Least Common Multiple (LCM) of Two Numbers
3 pages
Exp - No: Date: Power Flow Analysis by Newton-Raphson Method Aim
No ratings yet
Exp - No: Date: Power Flow Analysis by Newton-Raphson Method Aim
72 pages
Class X SST Study Material - 2023-24
No ratings yet
Class X SST Study Material - 2023-24
571 pages

CH 10

Uploaded by

CH 10

Uploaded by

时间序列分析-2023秋季课程

▪ 𝑐𝑐 channels and 𝑛𝑛 time steps, predict next 𝑚𝑚 steps

▪ 𝑐𝑐 channels and 𝑛𝑛 time steps, predict next 𝑚𝑚 steps

(a-1) and (a-2) are

normalizes every slice of the

� 𝑖𝑖 = 𝑾𝑾1 ∗ ML P 𝝁𝝁𝑖𝑖 − 𝜌𝜌𝑖𝑖 , 𝒙𝒙

split the internal output 𝑦𝑦� 𝑖𝑖 into 𝐾𝐾

▪ Channel Independent Training

▪ RMLP: RevIN + MLP + CI

▪ † indicates the temporal feature extractor with fixed random weights

(a) both patching and channel-

You might also like