Essay Doctorate 1,701 words

Trading Strategy of Cryptocurrencies

Last reviewed: September 18, 2018 ~9 min read

Clustering Cryptocurrencies
1. Introduction
Why is clustering interesting? How to value cryptocurrencies has been a major question ever since so many began finding their way to market. As Qunitero (2018) points out, “having a clear and unbiased benchmark while evaluating new decentralized projects in the crypto economy” could help to answer the question of valuation. Clustering commonly occurs around token type: thus, one routinely sees the clustering of currency tokens, platform tokens, utility tokens, brand tokens, and security tokens. Yet these are not the only clusters that may appear, the more closely one looks at the space. As clustering shows which cryptocurrencies move in tandem at the top of the market cap, it is useful to examine clustering cryptocurrencies to see what similarities in movement might tell us.
Are fundamental similarities backed by market metrics? That is the main question to be asked and an important one because clusters can be used to formulate trading strategies. However, Qunitero (2018) notes that there is more than one cluster in the cryptocurrency space—in fact, there are numerous ones. Identifying them and understanding the relationship among assets is critical to devising a successful trading strategy. Identifying clusters as part of developing a trading strategy for cryptocurrencies could help make the space far more viable for investors and speculators alike. “There do seem to exist natural clusters of coins that move in tandem,” Quintero (2018) states—which means more cryptocurrency samples need to be examined in order to clarify the seeming relationships.
2. Method and Results
Part I: Developing a Method
The problem of time series clustering can be considered as finding a function:
$$f(X_T) = y \\\\in [1...K]$$$$\\\\text{for }X_T=(x_1, ..., x_T)$$$$\\\\text{with }x_T \\\\in\\\\mathbb{R^d}$$
where T is timeline length and K is particular cluster. This should be conducted with representation of time series as a set of selected features vi of fixed size D independent of T.
With this representation, applying standard clustering algorithms on this feature set can be possible. The main question is what features to consider when applying the algorithm? For the purpose of this study, we identified multiple time series describing each coin and we also constructed derivative parameters to define these series.
Next, we devised a method of moving from simple to complex in terms of identifying clusters:
1. We used common, standard features for each series (parameter): Means, Medians, Standard deviations, Skewness, and Kurtosis.
2. We used tsfresh library to automate the process of features extraction.
3. We applied both approaches to series fragmented by state of BTC.
DBSCAN
It was important to identify a clustering method that could be applied quickly to facilitate trading and allow easy scaling. The clustering method selected, therefore, was DBSCAN, one of the most universal and applicable algorithms available today. The DBSCAN algorithm views clusters as areas of high density separated by areas of low density. Due to this simple if generic function, clusters found by DBSCAN can take any shape, as opposed to the k-means method, which assumes that clusters are convex shaped. As the purpose of this study was to identify clusters without applying presupposed views of what they should look like, the k-means method was inappropriate and DBSCAN, with its basic approach to recognizing clusters, fit much more effectively. This is why the data obtained in this study is quite rarified.
The central component to the DBSCAN is the concept of core samples, which are samples that occur in areas of high density. A cluster is therefore a set of core samples, each close to one another (measured by a distance measure) and a set of non-core samples that are close to a core sample (but are not themselves core samples). There are two parameters to the algorithm, min_samples and eps, which define formally what is meant by density. Higher min_samples or lower eps indicate higher density necessary to form a cluster.
The additional advantage of DBSCAN is the calculation of an estimated number of clusters that it permits. Using DBSCAN, top-level clusters could be obtained using data presence across all given coins. The Extractor function of basic features was applied to each large cluster identified. Clustering then became possible inside top-level clusters following the application of this function.
This quite basic method of features extraction for the development of coin profiles can be scaled: for example, features can be extracted for different periods of time, forming wider sets of features for each coin. Alternatively, a measure of similarity can be found for different periods and compiled in unified metrics across all periods. In short, clustering can be conducted across multiple variables as inputs. The next step in the process was to perform clustering relying on extracted features and additionally to use tsfresh library as an alternative approach.
Part II: Clustering Inside Top-Level Clusters
To accomplish clustering inside top-level clusters, which were identified using DBSCAN, Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) was used to perform the DBSCAN over varying epsilon values and to integrate the results to find a clustering that gives the best stability over epsilon. HDBSCAN allowed us to find clusters of varying densities (unlike DBSCAN), and be more robust in terms of parameter selection.
The limitations of this approach are that it is highly general. It does not detect mutual dependencies, following trends or other complex arrangement. Essentially, it is clustering by basic time series characteristics. Top-level clustering is a rather necessary measure and the list of features describing each time series is short, but it can easily be extended with additional features.
HDBSCAN (as an extension of DBSCAN method) groups together points that are closely packed together (points with many nearby neighbors), marking as outliers points that lie alone in low-density regions (whose nearest neighbors are too far away). This allows “unclustered coins” to appear: they are outliers categorized under the -1 label. This group consists of two types of coins: coins with a large amount of data that is not collected and actual outliers. Each coin presented in this group should be considered as special, including the crypto-headliners which appeared there. By many different sets of parameters, these headliners are actual outliers.
Part III: Features Extraction and Clustering
A new instance of Extractor class can be initiated by passing a periods_of_interest argument, which will slice the scope of coin by time axis with respect to periods of interest. This parameter is helpful with bull_bear_periods function execution. To ascertain market states, it is necessary to obtain periods of different market states in accordance with bullish, bearish or stable BTC trends. To perform clustering with the basic features extractor, it was first necessary to perform top-level clustering that would separate coins to groups in correspondence with the presence of data.
The result revealed three separate market states with corresponding time periods, which meant it was necessary to create three sets in accordance with states and perform clustering for each of them. HDBSCAN was used to obtain the clusters, and clustering in bearish periods appeared. The combination of using algorithm and passed parameters indeed allowed us to identify one brief bearish period (early 2018 following the steep fall in BTC). However, the aim is to make bull_bear_periods less rigorous. Clustering also appeared in bullish periods, with more clustering appearing in bullish periods. In stable periods (sideways trading), clustering also appeared. This was immensely interesting for the following reasons:
1. Quite a few stable periods were detected on the BTC timeline, making this part of the sample by far the most representative.
2. In contrast to bullish and bearish BTC states, the stable or sideways trading state can be postulated intuitively. The terms bullish and bearish are blurred and there no strong definitions of them, because they are not really understood as such until years later, looking back—but everyone can see when BTC is in a flat or sideways trading phase. In this context, the stable trading market offers the most predictive value.
Part IV: Features Extraction and Clustering
Comparing clusters obtained via various methods can be illuminative and offer insights into possible improvements of created pipeline and further research. There are several reasons why comparing clusters obtained by different methods (i.e., basic clustering and tsfresh clustering) is unsuitable. The main reasons are:
1. The metrics used by each approach are different.
2. There were no top-level clusters in the case of tsfresh.
In the case of tsfresh-based clustering we faced a problem of high dimensionality that did not permit a view of coins as points in space. Nonetheless we could create a simplified visualization of clusters as a graph, which consisted of connected subgraphs. Each subgraph represented corresponding clusters with all nodes mutually connected. These visualizations also revealed important findings:
· All coins inside clusters are connected a priori between each other. This means that special attention should be given to coins lying on the border of dense regions and their connections.
· Graphs do not take into account real distance between coins, but only illustrate which coins are connected, though not how they are connected.
· While one particular method may consider a coin an outlier, another method can find it in a cluster with others at the same time. This will lead to the presence of corresponding nodes on the graph.
3. Conclusion
Fundamentals cannot explain crypto price dynamics. Traditional fundamental clustering does not explain price movement. Therefore, we must reject the hypothesis that “Because they are currency coins we consider them to behave similarly” and we must reject trading based on fundamental clusters. As the study has shown the method used to obtain clusters will lead to significant deviations. The trading implications of this study are that clustering methods can be incorporated into an actual trading strategy, but more information is needed to determine the appropriate method. Market sluggishness and sensitivity could enable more prolonged measures to be conducted. Using HDBSCAN to study metrics and distances should also be explored.

References
Quintero, S. (2018). Clustering cryptocurrencies with affinity propagation. Retrieved
from https://towardsdatascience.com/clustering-cryptocurrencies-with-affinity-propagation-b0ca9e105bb6

You’re 100% through this paper. Sign up to read the full paper.

130,000+ paper examples AI writing assistant Citation generator Cancel anytime

Cite This Paper

PaperDue. (2018). Trading Strategy of Cryptocurrencies. PaperDue. https://www.paperdue.com/essay/trading-strategy-of-cryptocurrencies-essay-2172333

Always verify citation format against your institution’s current style guide requirements.