What is Bitcoin Correlated With? [PART 1/2]

in #bitcoin8 years ago (edited)

BTC.png

Have you ever wondered what influences the Bitcoin price? In this article I will reveal to you the relationship between different data and Bitcoin and how it affects the Bitcoin price. Keep in mind that correlation =/= causation, but nontheless, it can be used to model the price more accurately, and eventually do more accurate forecasts with it.

Well the price could be correlated with itself, with it's lagged self, otherwise it would be hard to forecast it accurately. If the price is correlated with it's lagged self, that implies seasonality, and a repeating pattern in the price, so forecasting could work this way.

However we are out of luck, the BTC_USD price is not seasonal, and it is only correlated with it's 1st lag, therefore any prediction we make on the price alone, won't reveal any information to us, and won't give us an edge over the market.

PRICE CORR.png

The ACF and PACF, according to the Box-Jenkins protocol, suggests us a:

Decay, starting after a few lags: Mixed autoregressive and moving average (ARMA) model.

So we know that the price will be an ARIMA model, so far so good. But first let's talk about correlation with other data.




Correlation

Now that we know that the price is only correlated with it's first lag, we know it's model, however this won't tell us anything useful, or anything that would give us an edge over the others at predicting the price. So I went around looking for data related to Bitcoin.

I have found many statistics at: Blockchain and at Bitcointalk.

These statistics are related very much to the price of Bitcoin, for example the Total Number Bitcoin's is effectively the Bitcoin supply, so it's obvious that it will correlate with the price. But I looked at the Number of transactions, the Exchange Trading Volume and even at the statistics on Bitcointalk, like: New Topics/Day, New Posts/Day, you name it.

It should be straight forward that the number of activity on the forum, and the statistics on Blockchain.info, should relate to the price, influence it, or even predict it.

So I have done a correlation matrix between these values:

PriceTotalBitcoinsDifficultyAvgblocksizeHashrate
1.00000.72740.55560.64810.5559Price
x1.00000.66490.84910.6650TotalBitcoins
xx1.00000.89630.9940Difficulty
xxx1.00000.8860Avgblocksize
xxxx1.0000Hashrate

And more....

PriceTotBTCWallet UsersNewTopicsNewPostsNewMembersOnlinePageviews
1.00000.65870.52760.65330.60730.60630.67510.5763Price
x1.00000.79000.90260.87130.86860.95690.8393TotBTC
xx1.00000.96280.97500.98590.62330.9886Wallet Users
xxx1.00000.99600.99130.78470.9882NewTopics
xxxx1.00000.99260.73570.9962NewPosts
xxxxx1.00000.72870.9932NewMembers
xxxxxx1.00000.6920Online
xxxxxxx1.0000Pageviews

And much more... I won't put them all here because it takes a while to format tables with Markdown, sigh... :( .... But you can now see the relationships between different data, how they affect the Bitcoin price. Now let's model the price based on these dependent variables.




ARIMAX

It is Auto Regressive Integrated Moving Average with Exogeneous Input, it's based on ARIMA but this model can handle multiple inputs instead of one. It is essential if we want to model the BTC_USD price with the other variables as regressors.

I am using the Blockchain.info price data obviously, on BTC_USD, with the data up until 22nd of October. Then the Correlogram shown earlier tells us that the maximum lag is 1. So we will have alternating ARIMA/ARMA models for each comparison. I have gone through all combinations, and will only show here the best ones.

Let's start forecasting the best datasets, by my usual methodology, forecasting 20% of the data from the rest of the 80%, in this case starting from the 2000th up until the latest.

Naked ARIMA ( 1 1 1 )

ARIMA 1 1 1 - NUDE (142.34).png

This is the naked ARIMA model for the price, without adding any additional inputs. It has an error of 142.34, therefore we expect, a better accuracy and smaller error from the regressed models. Moving on.

ARIMAX ( 1 1 1 ){ Bitcointalk Online People }

ARIMA 1 1 1 ONLINE 172.43.png

We can see that the number of people online on Bitcointalk, doesn't really help us predict the price, in fact it's even worse than our naked prediction with an error of 172.43. It is missing a lot of data therefore this variable is not that accurate.

ARIMAX ( 1 0 1 ){ Bitcointalk Pageviews }

ARIMA 1 0 1 PAGEVIEW 146.95.png

The pageview statistics is slightly better, but not that much. An error of 146.95. I guess there are many bots visiting the forum and people reloading pages, so it is not that accurate of a predictor for the price.

ARIMAX ( 1 1 1 ){ Total Bitcoins }

ARIMA 1 1 1 -TOT BTC (119.79).png

This is interesting. Despite the supply of Bitcoin having a correlation >70% with the price, it barely predicts it with an error of 119.79. I guess the market has priced it in. Although it's still a better predictor than the other variables above.

ARIMAX ( 1 0 1 ){ Bitcointalk Posts }

ARIMA 1 0 1 POSTS 111.4.png

It is getting better, the number of posts on the Bitcointalk forum, reflects the price much better, the error is only 111.4, so we are getting there! The number of posts actually reflects the number of users/buyers/sellers more, than the other statistics.

ARIMAX ( 1 0 1 ){ Bitcointalk New Members }

ARIMA 1 0 1 MEMBERS 101.66.png

Much better, the number of new members on the Bitcointalk forum has a predictive error of only 101.66, so we are very close to finding a variable that helps us predict the price. Looks like from fake bots visiting the forum, from forum spammers, to new accounts (still people could create many accounts), represents the number of bitcoin activity more and more accurately.


So by going through all variables, we are finding better and better predictors, and in the next article, I will reveal to you the best data that I have found!

TO BE CONTINUED ...


Disclaimer: The information provided on this page might be incorrect. I am not responsible if you lose money using the information on this page! This is not an investment advice, just my opinion and analysis for educational purposes.


Upvote, ReSteem & button.png

Coin Marketplace

STEEM 0.16
TRX 0.16
JST 0.030
BTC 57515.20
ETH 2438.35
USDT 1.00
SBD 2.34