Monash Forecasting Repository

Our Aim

Our aim is to introduce the first comprehensive time series forecasting repository containing datasets of related time series to facilitate the evaluation of global forecasting models. All datasets are intended to use only for research purpose. Our repository contains 30 datasets including both publicly available time series datasets (in different formats) and datasets curated by us. Many datasets have different versions based on the frequency and the inclusion of missing values, making the total number of dataset variations to 58. Furthermore, it includes both real-world and competition time series datasets covering varied domains.

We have also characterised each dataset and executed several baseline methods with them.

We recommend you to read our paper for a detailed discussion of the datasets, their original sources, feature analysis and baseline evaluation. If you use our work, please cite the paper "Rakshitha Godahewa, Christoph Bergmeir, Geoffrey I. Webb, Rob J. Hyndman, Pablo Montero-Manso, Monash Time Series Forecasting Archive".

            @InProceedings{godahewa2021monash,
              author = "Godahewa, Rakshitha and Bergmeir, Christoph and Webb, Geoffrey I. and Hyndman, Rob J. and Montero-Manso, Pablo",
              title = "Monash Time Series Forecasting Archive",
              booktitle = "Neural Information Processing Systems Track on Datasets and Benchmarks",
              year = "2021"
            }

Datasets

The following table shows a list of time series datasets that are currently available in our archive. The datasets are available in .tsf format which is a new format we propose to store time series data pioneered by sktime .ts format. The wrappers to load data into R and Python environments are available in our github repository.

Dataset	Domain	No: of Series	Min. Length	Max. Length	Competition	Multivariate	Download	Source
M1	Multiple	1001	15	150	Yes	No	Yearly Quarterly Monthly	Makridakis et al., 1982
M3	Multiple	3003	20	144	Yes	No	Yearly Quarterly Monthly Other	Makridakis and Hibon, 2000
M4	Multiple	100000	19	9933	Yes	No	Yearly Quarterly Monthly Weekly Daily Hourly	Makridakis et al., 2020
Tourism	Tourism	1311	11	333	Yes	No	Yearly Quarterly Monthly	Athanasopoulos et al., 2011
CIF 2016	Banking	72	34	120	Yes	No	Monthly	Stepnicka and Burda, 2017
London Smart Meters	Energy	5560	288	39648	No	No	W Missing W/O Missing	Jean-Michel, 2019
Aus. Electricity Demand	Energy	5	230736	232272	No	No	Half Hourly	Curated by us
Wind Farms	Energy	339	6345	527040	No	No	W Missing W/O Missing	Curated by us
Dominick	Sales	115704	28	393	No	No	Weekly	James M. Kilts Center, 2020
Bitcoin	Economic	18	2659	4581	No	No	W Missing W/O Missing	Curated by us
Pedestrian Counts	Transport	66	576	96424	No	No	Hourly	City of Melbourne, 2020
Vehicle Trips	Transport	329	70	243	No	No	W Missing W/O Missing	fivethirtyeight, 2015
KDD Cup 2018	Nature	270	9504	10920	Yes	No	W Missing W/O Missing	KDD Cup, 2018
Weather	Nature	3010	1332	65981	No	No	Daily	Sparks et al., 2020
NN5	Banking	111	791	791	Yes	Yes	Daily W Missing Daily W/O Missing Weekly	Ben Taieb et al., 2012
Web Traffic	Web	145063	803	803	Yes	Yes	Daily W Missing Daily W/O Missing Weekly	Google, 2017
Solar	Energy	137	52560	52560	No	Yes	10 Minutes Weekly	Solar, 2020
Electricity	Energy	321	26304	26304	No	Yes	Hourly Weekly	UCI, 2020
Car Parts	Sales	2674	51	51	No	Yes	W Missing W/O Missing	Hyndman, 2015
FRED-MD	Economic	107	728	728	No	Yes	Monthly	McCracken and Ng, 2016
San Francisco Traffic	Transport	862	17544	17544	No	Yes	Hourly Weekly	Caltrans, 2020
Rideshare	Transport	2304	541	541	No	Yes	W Missing W/O Missing	Curated by us
Hospital	Health	767	84	84	No	Yes	Monthly	Hyndman, 2015
COVID Deaths	Nature	266	212	212	No	Yes	Daily	Johns Hopkins University, 2020
Temperature Rain	Nature	32072	725	725	No	Yes	W Missing W/O Missing	Curated by us
Sunspot	Nature	1	73931	73931	No	No	W Missing W/O Missing	Sunspot, 2015
Saugeen River Flow	Nature	1	23741	23741	No	No	Daily	McLeod and Gweon, 2013
US Births	Nature	1	7305	7305	No	No	Daily	Pruim et al., 2020
Solar Power	Energy	1	7397222	7397222	No	No	4 Seconds	Curated by us
Wind Power	Energy	1	7397147	7397147	No	No	4 Seconds	Curated by us

Results

In our paper, we have evaluated the performance of 13 baseline forecasting methods across the datasets in our repository. The baseline methods include 6 traditional univariate forecasting models: Simple Exponential Smoothing (SES), Theta (Assimakopoulos and Nikolopoulos, 2000), Exponential Smoothing (ETS, Hyndman, 2008), Auto-Regressive Integrated Moving Average (ARIMA, Box and Jenkins, 1990), Trigonometric Box-Cox ARMA Trend Seasonal (TBATS, Livera et al., 2011) and Dynamic Harmonic Regression ARIMA (DHR-ARIMA, Hyndman and Athanasopoulos, 2021), and 7 global forecasting models: Pooled Regression (PR, Trapero et al., 2015), CatBoost (Prokhorenkova et al., 2018), Feed-Forward Neural Network (FFNN, Goodfellow et al., 2016), DeepAR (Salinas et al., 2020), N-BEATS (Oreshkin et al., 2019), WaveNet (Borovykh et al., 2017) and Transformer (Vaswani et al., 2017). Later, we have executed Prophet model as an additional baseline across all datasets in our repository. Furthermore, we have shown the results we obtained from the Informer model (Zhou et al, 2021) for some of the datasets. For evaluation, we use 4 error metrics namely the symmetric Mean Absolute Percentage Error (sMAPE), Mean Absolute Scaled Error (MASE, Hyndman and Koehler, 2006), Mean Absolute Error (MAE, Sammut and Webb, 2010), and Root Mean Squared Error (RMSE). The sMAPE error measure was calculated in 2 ways: the original version and the modified version suggested by Suilin (2017). For more details of these baseline methods and error metrics, please refer to our paper.

The results table shows the results of mean MASE of each baseline method across the datasets in our repository. The best model across each dataset is highlighted in boldface. We use 2 versions of ARIMA. The results of the general ARIMA method are reported for yearly, quarterly, monthly, and daily datasets whereas the results of DHR-ARIMA are reported for weekly datasets and multi-seasonal datasets such as 10 minutely, half hourly, and hourly.

The results of all error metrics across all baselines except the Prophet and Informer models are available in the online appendix. The results of all error metrics across the Prophet model are available here. The results of all error metrics across the Informer model, the reasons for selecting the datasets for the Informer model experiments and the considered Informer model configurations are available here.

In November 2024, we have updated the repository with benchmark runs of many more deep-learning methods, namely Autoformer, DLinear, NLinear, NBEATS, N-HITS, TiDE, PatchTST, and TimesFM. A report detailing these new runs is here.

We also expect to run new baselines in the future and the results tables will be updated accordingly. As new forecasting models emerge rapidly, we also provide a simple interface for you to implement other statistical, machine learning and deep learning baselines. Our github repository contains detailed instructions and example code snippets explaining how to integrate new forecasting models to our framework. The results of the newly integrated forecasting models are also evaluated in the same way as our baselines using the same evaluation metrics and thus, the results of new forecasting models and our baselines are directly comparable. After integrating the new forecasting models, you can send us a pull-request on github to officially integrate your implementations to our framework. You are also invited to send us the results of your new forecasting models. If computationally feasible, we expect to re-execute the models and confirm the results. In the future, we expect to maintain two results tables here with the confirmed and unconfirmed results of the forecasting models.

A spreadsheet with all results and also some other error measures is here.

Dataset	SES	Theta	TBATS	ETS	(DHR-) ARIMA	PR	Cat- Boost	FFNN	DeepAR	N-BEATS	WaveNet	Trans- former	Prophet	Informer^*	Auto- former	DLinear ind.	DLinear mvar	N-BEATS global	N-HITS global	NLinear ind.	NLinear m.var.	PatchTST ind.	PatchTST m.var.	TiDE global	TimesFM(ZS) found.
Aus. Elecdemand	1.857	1.867	1.174	5.663	2.574	0.78	0.705	1.222	1.591	1.014	1.102	1.113	1.414	-	-	-	-	1.094	1.143	-	-	-	-	1.217	2.109
Bitcoin	4.327	4.344	4.611	2.718	4.03	2.664	2.888	6.006	6.394	7.254	5.315	8.462	11.089	-	-	-	-	6.408	6.066	-	-	-	-	-	5.366
Carparts	0.897	0.914	0.998	0.925	0.926	0.755	0.853	0.747	0.747	2.836	0.754	0.746	0.876	-	1.247	1.403	0.752	0.753	0.748	1.398	1.045	1.142	1.075	0.747	0.852
CIF 2016	1.291	0.997	0.861	0.841	0.929	1.019	1.175	1.053	1.159	0.971	1.8	1.173	1.029	-	-	-	-	2.261	1.622	-	-	-	-	2.352	1.261
COVID	7.776	7.793	5.719	5.326	6.117	8.731	8.241	5.459	6.895	5.858	7.835	8.941	12.77	-	7.221	9.2	5.601	7.911	7.362	6.287	5.176	8.918	8.111	9.069	8.794
Dominick	0.582	0.61	0.722	0.595	0.796	0.98	1.038	0.614	0.54	0.952	0.531	0.531	0.827	-	-	-	-	0.511	0.507	-	-	-	-	0.55	0.528
Electricity Hourly	4.544	4.545	3.69	6.501	4.602	2.912	2.262	3.2	2.516	1.968	1.606	2.522	2.05	2.682	2.4	1.883	1.88	2.005	2.073	1.891	1.882	2.131	2.138	2.127	2.265
Electricity Weekly	1.536	1.476	0.792	1.526	0.878	0.916	0.815	0.769	1.005	0.8	1.25	1.77	0.924	1.444	0.929	1.096	0.78	0.884	0.986	1.001	0.792	0.996	0.846	0.811	0.924
FRED-MD	0.617	0.698	0.502	0.468	0.533	8.827	0.947	0.601	0.64	0.604	0.806	1.823	1.843	17.839	0.621	0.548	0.627	0.666	0.66	0.499	0.843	0.47	0.573	0.901	0.635
Hospital	0.813	0.761	0.768	0.765	0.787	0.782	0.798	0.84	0.769	0.791	0.779	1.031	0.673	1.221	0.93	0.931	0.8	0.786	0.802	0.857	0.803	0.855	0.809	0.85	0.771
Kaggle Weekly	0.698	0.694	0.622	0.77	0.815	1.021	1.928	0.689	0.758	0.667	0.628	0.888	1.196	-	-	-	-	0.613	0.582	-	-	-	-	0.72	0.602
KDD	1.645	1.646	1.394	1.787	1.982	1.265	1.233	1.228	1.699	1.6	1.185	1.696	1.186	-	-	-	-	1.252	1.194	-	-	-	-	1.304	1.361
M1 Monthly	1.379	1.091	1.118	1.074	1.164	1.123	1.209	1.205	1.192	1.168	1.2	2.191	1.712	-	-	-	-	1.253	1.257	-	-	-	-	1.501	1.223
M1 Quarterly	1.929	1.702	1.694	1.658	1.787	1.892	2.031	1.862	1.833	1.788	1.7	2.772	2.136	-	-	-	-	2.543	2.369	-	-	-	-	2.777	1.882
M1 Yearly	4.938	4.191	3.499	3.771	4.479	4.588	4.427	4.355	4.603	4.384	4.666	5.519	5.633	-	-	-	-	5.525	4.84	-	-	-	-	6.258	5.257
M3 Monthly	1.091	0.864	0.861	0.865	0.873	1.01	1.065	1.011	1.167	0.934	1.008	1.454	1.375	-	-	-	-	0.916	0.906	-	-	-	-	1.066	0.983
M3 Other	3.089	2.271	1.848	1.814	1.831	2.655	3.178	2.615	2.975	2.39	2.127	2.781	4.694	-	-	-	-	-	-	-	-	-	-	-	-
M3 Quarterly	1.417	1.117	1.256	1.17	1.24	1.248	1.441	1.329	1.31	1.182	1.29	2.452	1.672	-	-	-	-	1.134	1.136	-	-	-	-	1.522	1.29
M3 Yearly	3.167	2.774	3.127	2.86	3.417	3.223	3.788	3.399	3.508	2.961	3.014	3.003	4.152	-	-	-	-	3.024	2.824	-	-	-	-	3.272	3.428
M4 Daily	1.154	1.153	1.157	1.239	1.179	1.162	1.593	1.141	2.212	1.218	1.157	1.377	3.698	-	-	-	-	1.125	1.142	-	-	-	-	1.324	1.328
M4 Hourly	11.607	11.524	2.663	26.69	13.557	1.662	1.771	2.862	2.145	2.247	1.68	8.84	1.776	-	-	-	-	2.756	3.781	-	-	-	-	3.081	0.812
M4 Monthly	1.15	0.97	1.053	0.948	0.962	1.08	1.093	1.151	1.163	1.026	1.16	2.125	1.367	-	-	-	-	0.97	0.987	-	-	-	-	1.122	1.063
M4 Quarterly	1.417	1.231	1.186	1.161	1.228	1.316	1.338	1.42	1.274	1.239	1.242	1.52	1.758	-	-	-	-	1.209	1.226	-	-	-	-	1.469	1.37
M4 Weekly	0.587	0.546	0.504	0.575	0.55	0.481	0.615	0.545	0.586	0.453	0.587	0.695	1.049	-	-	-	-	0.484	0.473	-	-	-	-	0.56	0.463
NA	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	3.43	3.344	-	-	-	-	3.698	-
NN5 Daily	1.521	0.885	0.858	0.865	1.013	1.263	0.973	0.941	0.919	1.134	0.916	0.958	0.883	0.933	1.087	0.96	0.961	0.901	0.908	0.957	0.957	1.156	1.017	0.962	1.483
NN5 Weekly	0.903	0.885	0.872	0.911	0.887	0.854	0.853	0.85	0.863	0.808	1.123	1.141	0.927	1.079	1.032	1.126	0.879	0.824	0.864	1.098	0.847	1.011	0.961	0.848	0.857
Pedestrians	0.957	0.958	1.297	1.19	3.947	0.256	0.262	0.267	0.272	0.38	0.247	0.274	2.034	-	-	-	-	0.245	0.241	-	-	-	-	0.245	0.257
Rideshare	3.014	3.641	3.067	4.04	1.53	3.019	2.908	4.198	4.029	3.877	3.009	4.04	4.666	-	7.68	4.58	4.314	4.108	4.041	4.485	4.418	3.617	3.753	4.164	2.985
Saugeen	1.426	1.425	1.477	2.036	1.485	1.674	1.411	1.524	1.56	1.852	1.471	1.861	1.51	1.896	1.546	1.602	1.602	1.587	1.668	1.443	1.443	1.492	1.492	1.536	2.225
Solar 10 Mins	1.451	1.452	3.936	1.451	1.034	1.451	2.504	1.45	1.45	1.573	-	1.451	1.821	1.614	-	1.793	1.611	2.379	2.406	1.781	1.771	1.439	1.437	1.473	1.456
Solar Weekly	1.215	1.224	0.916	1.134	0.848	1.053	1.53	1.045	0.725	1.184	1.961	0.574	1.508	2.408	1.767	1.102	1.157	1.306	1.384	1.312	1.248	-	-	1.099	0.97
Sunspot	0.128	0.128	0.067	0.128	0.067	0.099	0.059	0.207	0.02	0.375	0.004	0.003	0.852	0.504	0.133	0.118	0.118	0.192	0.166	0.083	0.083	0.119	0.119	0.174	0.272
Temp. Rain	1.347	1.368	1.227	1.401	1.174	0.876	1.028	0.847	0.785	1.3	0.786	0.687	1.15	-	-	-	1.093	0.791	0.795	-	1.725	-	0.861	1.201	1.05
Tourism Monthly	3.306	1.649	1.751	1.526	1.589	1.678	1.699	1.582	1.409	1.574	1.482	1.571	2.008	-	-	-	-	1.465	1.476	-	-	-	-	1.705	2.273
Tourism Quarterly	3.21	1.661	1.835	1.592	1.782	1.643	1.793	1.678	1.597	1.475	1.714	1.859	2.153	-	-	-	-	1.57	1.535	-	-	-	-	2.844	2.193
Tourism Yearly	3.253	3.015	3.685	3.395	3.775	3.516	3.553	3.401	3.205	2.977	3.624	3.552	2.59	-	-	-	-	109.908	3.195	-	-	-	-	5.358	3.259
Traffic Hourly	1.922	1.922	2.482	2.294	2.535	1.281	1.571	0.892	0.825	1.1	1.066	0.821	1.316	1.439	-	0.963	0.923	0.873	0.85	0.922	0.918	0.964	0.896	0.924	0.799
Traffic Weekly	1.116	1.121	1.148	1.125	1.191	1.122	1.116	1.15	1.182	1.094	1.233	1.555	1.084	1.323	1.476	1.487	1.096	1.117	1.083	1.357	1.103	1.266	1.168	1.124	1.04
US Births	4.343	2.138	1.453	1.529	1.917	2.094	1.609	2.032	1.548	1.537	1.837	1.65	5.626	2.22	1.688	2.166	2.166	1.438	1.51	2.145	2.145	2.607	2.607	2.723	4.475
Vehicle Trips	1.224	1.244	1.86	1.305	1.282	1.212	1.176	1.843	1.929	2.143	1.851	2.532	2.428	-	-	-	-	1.747	1.728	-	-	-	-	2.015	2.164
Weather	0.677	0.749	0.689	0.702	0.746	3.046	0.762	0.638	0.631	0.717	0.721	0.65	0.88	-	-	-	-	0.625	0.621	-	-	-	-	0.648	0.621

^*The results of the Informer model are only recorded for the datasets with equal-length series. For the datasets with unequal-length series, the Informer model is required to be executed per each series where the execution time is considerably high (for details, see here). Furthermore, the intermittent datasets such as Carparts, Rideshare, Web Traffic, Covid Deaths and Temperature Rain are not considered for the Informer experiments.

About Us

Team members

We are a group of time series researchers from Monash University and University of Sydney:

Rakshitha Godahewa

Christoph Bergmeir

Geoff Webb

Rob Hyndman

Pablo Montero-Manso

Contributors

The following people have contributed to our repository:

Gareth Davies

Contribute to Our Repository

We encourage other researchers to contribute time series datasets or benchmarking results to our repository either by directly uploading the datasets into our repository and/or by contacting us via email. You will then be listed as a contributor, and in the acknowledgements section.

If there are any copyright issues of the datasets, please contact us via email.

Acknowledgement

We are very grateful to the Department of Data Science and Artificial Intelligence of Monash University for their sponsorship.

Gareth Davies from Neural Aspect has contributed benchmark runs of many of the deep-learning methods, namely Autoformer, DLinear, NLinear, NBEATS, N-HITS, TiDE, PatchTST, and TimesFM.

Monash Time Series Forecasting Repository

Our Aim

Datasets

Results

Important Links

Paper

Features

Online Appendix

GitHub

About Us

Team members

Contributors

Contribute to Our Repository

Acknowledgement