Back to Blog

Improve your ML with EQ

Hugo Birkelund
Archived blog post. This blog post has been transferred from our previous blogging platform. Links and images may not work as intended.

Machine learning (ML) and artificial intelligence (AI) present some fascinating modeling approaches. However, even with ML and hyper-fast computing, the old saying "garbage in, garbage out" or simply GIGO holds true. EQ provides structured and curated data for modeling. Our database is a treasure trove for model developers.

EQ = self-service

Help yourself to data. EQ’s business is to provide the best possible curated historical data and aligned forecasts, empowering you to excel in your job with minimum waste of time and energy. If you don't already have an account, get your access here.

Looking for something that is not there

Throwing in more data will not help you (much) if what you are looking for is not there. I have tried to illustrate the problem with two pictures. In the left picture, I have simply removed information and added noise, analog to a what you might see in a dataset.

 

Thanks to Fiona for guarding my Parrot Zik headset.

You can easily observe the problem in a picture. However, in data analysis, you can, at best, anticipate that something is missing. It could be too big times steps, poor spatial resolution, misaligned input used for model fitting and forecasting, or good old fashioned out-layers and missing values. More data might add to the problem and certainly waste your time.

A checklist

A checklist before you hit enter "estimate model" might save you a lot of time:

  1. Check your input data. At EQ, we do this as a profession. For instance, we provide curated synthetic data which we believe is a far superior alternative to actuals generated by TSOs.
  2. Avoid settling for raw weather data. Failing to align the output generated by weather models with real-life observations creates nasty errors and gut-wrenching frustrations. I know, from personal experience.
  3. Consider adding contextual information. Capacity changes, holiday effects, REMIT data, and more input add depth to the analysis. Failing to add context might contaminate your model and drain it for explanatory power.
  4. Look out for "known" non-linearity. A simple trick is, e.g. to use transformed data. Regardless of how advanced your model is, this simple trick might limit the data requirement tremendously, and improve the forecasts properties.

Take vitamins, not medicine

EQ provides a vast set of curated historical weather-driven fundamentals and indices. We believe they are perfect input for modeling and forecasting. Further, we provide all data in 15 minutes resolution, which is likely to become the power market’s preferred resolution. Using the correct data granularity from the get-go hugely simplifies modeling.

Contact me to discuss the best solutions for your team. We may have your data requirement covered, faster and simpler than you can imagine.

...I almost forgot

You do not have to download EQ’s data. Connect your developer tool or models directly into EQ’s timeseries database. It saves you time and simplifies your life.

EQ_Cut_Down_on_Data_managment_2

EQ generates approximately 4,5 million data points per price areas every day. It makes sense to leave the data in our custody.

More from the Blog

Could France become a net-importer with nuclear power at it’s lowest since 1991?

Eylert Ellefsen
Eylert Ellefsen

The available nuclear capacity in France was very low at the end of Q4 last year, whilst outlooks for Q1-2022 were also lower than normal by the end of 2021. Around this time, we published a blog post focusing on the low availability during February, which could mean a strained supply situation in a cold-weather scenario. This was reflected in very strong forward prices for Q1-months during December as markets included strong risk premiums in case of a cold spell.

Read Story

Nuclear + Coal Exits in Germany – Status Update January 2022

Eylert Ellefsen
Eylert Ellefsen

The German nuclear and coal exits since 2020 have had a strong influence on the country’s production fleet. In order to identify and illustrate the changes in power generation sources within the last 2 years, EQ has studied the German January power balances for 2020, 2021, and 2022. Initially, we have updated both the exit and capacity plans for the nuclear, as well as the coal-fleet as an introduction to the power balance and production numbers.

Read Story

Hydropower Review Alp Region 2021 and Outlooks for 2022

Eylert Ellefsen
Eylert Ellefsen

Electricity generated by hydropower across the Alp region (France, Italy, Austria, Switzerland, and Germany) has an average annual net production of about 187 TWh - covering approximately 14% of total consumption in the area. Of these countries, France has the highest hydropower output at 59 TWh net, while Switzerland and Austria at about 35 TWh are what we would consider as the most typical hydropower areas – which cover about 50% of their consumption from hydropower. This blog post from EQ contains a short review of the hydropower conditions for 2021, as well as a view of the current situation and hydrological balance.

Read Story

Ready to try Energy Quantified?

No payment or credit card required.
Would you rather like a personal demo? Book a demo