{"id":14337,"date":"2024-02-20T15:30:44","date_gmt":"2024-02-20T20:30:44","guid":{"rendered":"https:\/\/jasonapollovoss.com\/web\/?p=14337"},"modified":"2025-09-05T16:26:55","modified_gmt":"2025-09-05T22:26:55","slug":"independent-validation-data-handily-beats-the-sp-500","status":"publish","type":"post","link":"https:\/\/jasonapollovoss.com\/web\/2024\/02\/20\/independent-validation-data-handily-beats-the-sp-500\/","title":{"rendered":"Independent Validation: DATA Handily Beats the S&#038;P 500"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;section&#8221; _builder_version=&#8221;4.16&#8243; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_row admin_label=&#8221;row&#8221; _builder_version=&#8221;4.16&#8243; background_size=&#8221;initial&#8221; background_position=&#8221;top_left&#8221; background_repeat=&#8221;repeat&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; custom_padding=&#8221;|||&#8221; global_colors_info=&#8221;{}&#8221; custom_padding__hover=&#8221;|||&#8221; theme_builder_area=&#8221;post_content&#8221;][et_pb_text admin_label=&#8221;Text&#8221; _builder_version=&#8221;4.16&#8243; background_size=&#8221;initial&#8221; background_position=&#8221;top_left&#8221; background_repeat=&#8221;repeat&#8221; global_colors_info=&#8221;{}&#8221; theme_builder_area=&#8221;post_content&#8221;]<\/p>\n<figure class=\"x-el x-el-figure c2-1 c2-2 c2-3x c2-i c2-h c2-21 c2-2c c2-29 c2-2a c2-5i c2-4v c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\">\n<div><\/div>\n<\/figure>\n<p><span style=\"font-family: futural;\">One of the most frequent requests we receive at Deception And Truth Analysis is to independently validate our claims that DATA Scores are predictive of future stock price movements.<\/span><\/p>\n<p><span style=\"font-family: futural;\">After a nearly six month review we are pleased to report that we have received that independent validation from CloudQuant[i] with the publication of their whitepaper\u00a0<a class=\"x-el x-el-a c2-2w c2-2x c2-66 c2-v c2-w c2-x c2-j c2-67 c2-3 c2-30 c2-31 c2-11 c2-32\" href=\"https:\/\/docsend.com\/view\/xukyc8uq49tgjd6z\" rel=\"\"><em class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-64\">Outperforming the Market with Measures of Deceptive and Truthful Language in Regulatory Filings<\/em><\/a>. Specifically, they found that DATA Scores provide a significant return advantage for different capitalization investment strategies, as well as in various sector strategies. In this article we will discuss CloudQuant\u2019s independent validation that shows DATA handily beats the S&amp;P 500.<\/span><\/p>\n<p><span style=\"font-family: futural;\">Now the only question, with respect, is: Are you interested in generating alpha, or not?<\/span><\/p>\n<div>\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\"><\/strong><\/span><\/h4>\n<h3 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">DATA Handily Beats the S&amp;P 500: Major Results*<\/strong><\/span><\/h3>\n<\/div>\n<p><span style=\"font-family: futural;\">CloudQuant constructed a portfolio that featured a large portion of the most truthful companies as assessed by their DATA Scores. What they found is:\u00a0<\/span><\/p>\n<p><span style=\"font-family: futural;\"><em class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-64\">[* For those interested in CloudQuant\u2019s methodology we have summarized it below.]<\/em><\/span><\/p>\n<p><span style=\"font-family: futural;\">First, CloudQuant constructed a trading strategy that took advantage of some of the findings from their study. Namely, they used the period of 2008 through 2019 to define their parameters, and with a testing period of 2020 through March 2023. Also, they used the default settings described below, but with two modifications: a) the portfolio was long-only and b) the Percentile Rank Cutoff was set to the top or bottom 35%, depending on the sector.<\/span><\/p>\n<p><span style=\"font-family: futural;\">This resulted in an Annualized Net Outperformance Return of 6.26% above the equal-weighted S&amp;P 500. The idiosyncratic return registered a cumulative 86.29% outperformance for the preceding 15 years. For this trading strategy the turnover was relatively low at just 2.2%. These results assumed trading costs of 5 basis points per trade.<\/span><\/p>\n<p><span style=\"font-family: futural;\">Here are additional details from this test as shown in the whitepaper\u2019s Figure 3.5.1:<\/span><\/p>\n<figure class=\"x-el x-el-figure c2-1 c2-2 c2-3x c2-i c2-h c2-21 c2-2c c2-29 c2-2a c2-5i c2-4v c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\">\n<div>\n<div><span style=\"font-family: futural;\"><img decoding=\"async\" class=\"x-el x-el-img c2-1 c2-2 c2-k c2-21 c2-1x c2-1y c2-29 c2-2b c2-s c2-68 c2-4f c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" title=\"CloudQuant Figure 3.5.1\" src=\"https:\/\/img1.wsimg.com\/isteam\/ip\/b4167b12-c211-4a45-9c4b-489be14138f8\/CloudQuant%20Figure%203.5.1.png\/:\/cr=t:0%25,l:0%25,w:100%25,h:100%25\/rs=w:1280\" alt=\"CloudQuant Figure 3.5.1\" \/><\/span><\/div>\n<\/div><figcaption class=\"x-el x-el-figcaption c2-1 c2-2 c2-v c2-w c2-3d c2-29 c2-2b c2-49 c2-69 c2-6a c2-6b c2-6c c2-3 c2-6d c2-3e c2-10 c2-3f c2-3g c2-3h c2-3i\"><span style=\"font-family: futural;\">CloudQuant Figure 3.5.1<\/span><\/figcaption><\/figure>\n<p><span style=\"font-family: futural;\">Second, for the highest performing sectors there is outperformance for at least 80 days after a position is initiated.<\/span><\/p>\n<p><span style=\"font-family: futural;\">Third, for most of the sectors, the market does not begin to price the DATA Score assessment until 30 days after the publication of 10-Ks and 10-Qs (i.e. the day that DATA assesses these documents and publishes its DATA Scores). DATA speculates that this is because most investors respond to and price the quantitative financial results such as EPS rather than deceptiveness or truthfulness. These outcomes are driven by the management of a company\u2019s linguistic choices; which, in turn, are driven by their behaviors. DATA very specifically is designed to measure deceptive and truthful behaviors. Thus, a lagged response makes sense.<\/span><\/p>\n<p><span style=\"font-family: futural;\">Finally, because CloudQuant found that holding period is an important parameter they varied both the Lookback Period and the Holding Period and found that there is a consistent increase in Sharpe Ratio as the Lookback Period increases from 30 to 70 days. This means that DATA Scores are a useful metric throughout an entire quarter with peak performance at 20 and 80 Holding Period Days with 70 Lookback Period Days. To quote CloudQuant, they say:<\/span><\/p>\n<hr class=\"x-el x-el-hr c2-1 c2-2 c2-6g c2-6h c2-4k c2-29 c2-2b c2-k c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" \/>\n<p><span style=\"font-family: futural;\">\u201c[This is] a very interesting result. Typically, datasets used for investing will exhibit a very rapid (minutes to several days) decay in their returns resulting in very limited ability for large investment managers to utilize them in a significant way.\u201d<\/span><\/p>\n<hr class=\"x-el x-el-hr c2-1 c2-2 c2-6g c2-6h c2-4k c2-29 c2-2b c2-k c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" \/>\n<div>\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\"><\/strong><\/span><\/h4>\n<h3 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">DATA Handily Beats the S&amp;P 500: Methodology<\/strong><\/span><\/h3>\n<\/div>\n<ol>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data assessed<\/u>. CloudQuant assessed DATA\u2019s U.S. SEC 10-Q and 10-K filings dataset, DATAbase, as well as the Managements\u2019 Discussion &amp; Analysis sections of these documents.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data quantity<\/u>. In total there were 6.7 million data points from 213,522 earnings filings from 6,191 companies over 14 years.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Biases assessed<\/u>. CloudQuant tested for both LookAhead Bias and Survivorship Bias and found both to be low.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data examined<\/u>. CloudQuant evaluated three primary outputs from DATA, our DATA Score, the average DATA Score for deceptive fragments, and the average DATA Score for Truthful fragments. Very importantly, DATA\u2019s algorithm has never seen a stock price. Instead, DATA\u2019s assessments are an NLP assessment of known-to-science behavioral differences between deceivers and truth tellers.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Transaction fees<\/u>. For each entry or exit trade they assumed a 5-basis point transaction fee.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Composition<\/u>. Their index composition was not static and was reassessed and redefined at the commencement of each calendar year to ensure that the universe of stock symbols remained representative of their respective market capitalization categories. This means that the strategies adapt to Mergers and Acquisition activity, ascendant companies, and company size dynamics.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Rebalancing<\/u>. Two methods of rebalancing were used: a) constant holding period with event-based entry; and b) daily rebalancing with a fixed lookback window and a maximum holding period.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Position Entry<\/u>. Multiple lookback periods were considered, as well looking at the percentile ranking of companies based on their DATA Scores. Once DATA Scores were converted into percentiles, CloudQuant also established threshold values for long and short positions.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Position Exit<\/u>. Two methods of exiting positions were examined; a) daily re-balancing with a fixed lookback window; and event based entry with a constant holding period.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Weighting<\/u>. Two methods of weighting portfolios was considered; a) equal weight, and b) an event-driven approach, where the sizing is adjusted based on the signal count for each entry date.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Entry Trade Execution Lag<\/u>. To account for the fact it can take a long time to fully purchase a given position, CloudQuant looked at execution times ranging from 0 to up to 50 trading days.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Holding Period<\/u>. Holding periods were varied from between 20 to 80 days. This allowed CloudQuant to evaluate the decay rate of the signals, potential investment capacity and performance characteristics for medium to long-term investments.<\/span><\/li>\n<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Structure<\/u>. Two types of portfolio structure were considered; a) long-only, and b) dollar-neutral.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-family: futural;\">For its back tests conducted on the S&amp;P 500, CloudQuant\u2019s default parameters were as follows:<\/span><\/p>\n<ol>\n<li><span style=\"font-family: futural;\">Portfolio Structure: dollar neutral.<\/span><\/li>\n<li><span style=\"font-family: futural;\">Lookback Period: 60 days.<\/span><\/li>\n<li><span style=\"font-family: futural;\">Cross-Sectional Percentile Ranking of raw DATA Score.<\/span><\/li>\n<li><span style=\"font-family: futural;\">Entry Trade Execution Lag: 0 days.<\/span><\/li>\n<li><span style=\"font-family: futural;\">Holding Period: 40 days.<\/span><\/li>\n<li><span style=\"font-family: futural;\">Percentile Rank Cutoff Threshold: 0.50\/0.50<\/span><\/li>\n<li><span style=\"font-family: futural;\">Portfolio Weights: equal-weighted and the date range is 2008 through 2022.\u00a0<\/span><\/li>\n<\/ol>\n<p><span style=\"font-family: futural;\">For a full list of parameters, please refer to the whitepaper, Section 2, pages 2-9.<\/span><\/p>\n<div>\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\"><\/strong><\/span><\/h4>\n<h3 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">What\u2019s Up Next?<\/strong><\/span><\/h3>\n<\/div>\n<p><span style=\"font-family: futural;\">This is the first of six articles we are authoring to summarize CloudQuant\u2019s and Solactive\u2019s independent validation of DATA Scores\u2019 ability to predict future stock price movements. Here is an overview of this and forthcoming articles:<\/span><\/p>\n<ol>\n<li><span style=\"font-family: futural;\">DATA Handily Beats the S&amp;P 500 \u2013 A summary of CloudQuant\u2019s large cap findings.<\/span><\/li>\n<li><span style=\"font-family: futural;\">DATA\u2019s Stock Picking Batting Average \u2013 Solactive found that DATA Scores make the right judgment call on stock\u2019s going up a very large amount of the time.<\/span><\/li>\n<li><span style=\"font-family: futural;\">DATA Handily Beats the Russell 2000 \u2013 A summary of CloudQuant\u2019s small cap findings.<\/span><\/li>\n<li><span style=\"font-family: futural;\">DATA Measures Something Unique \u2013 A summary of CloudQuant\u2019s latency findings that demonstrate that it takes markets many weeks to price the behaviors that DATA is measuring.<\/span><\/li>\n<li><span style=\"font-family: futural;\">DATA\u2019s Industry Performance \u2013 A summary of CloudQuant\u2019s industry findings. The report card indicates that DATA is excellent in picking some industries, less so in others.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-family: futural;\">\u00a0\u00a0\u00a0\u00a0<\/span><\/p>\n<p><span style=\"font-family: futural;\">[i] Solactive has also independently validated the DATA platform with interesting results that will feature in a forthcoming article.<\/span><\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>One of the most frequent requests we receive at Deception And Truth Analysis is to independently validate our claims that DATA Scores are predictive of future stock price movements. After a nearly six month review we are pleased to report that we have received that independent validation from CloudQuant[i] with the publication of their whitepaper\u00a0Outperforming [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":14338,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"<figure class=\"x-el x-el-figure c2-1 c2-2 c2-3x c2-i c2-h c2-21 c2-2c c2-29 c2-2a c2-5i c2-4v c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\">\r\n<div>\r\n<div><span style=\"font-family: futural;\"><img class=\"x-el x-el-img c2-1 c2-2 c2-k c2-21 c2-1x c2-1y c2-29 c2-2b c2-s c2-68 c2-4f c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" title=\"DATA Handily Beats the S&amp;P 500\" src=\"https:\/\/img1.wsimg.com\/isteam\/ip\/b4167b12-c211-4a45-9c4b-489be14138f8\/CloudQuant%20Figure%203.5.1.png\/:\/cr=t:0%25,l:0%25,w:100%25,h:100%25\/rs=w:1280\" alt=\"DATA Handily Beats the S&amp;P 500\" \/><\/span><\/div>\r\n<\/div>\r\n<figcaption class=\"x-el x-el-figcaption c2-1 c2-2 c2-v c2-w c2-3d c2-29 c2-2b c2-49 c2-69 c2-6a c2-6b c2-6c c2-3 c2-6d c2-3e c2-10 c2-3f c2-3g c2-3h c2-3i\"><span style=\"font-family: futural;\">DATA Handily Beats the S&amp;P 500<\/span><\/figcaption><\/figure>\r\n<em><span style=\"font-family: futural;\">By Jason A. Voss, CFA<\/span><\/em>\r\n\r\n<span style=\"font-family: futural;\">One of the most frequent requests we receive at Deception And Truth Analysis is to independently validate our claims that DATA Scores are predictive of future stock price movements.<\/span>\r\n\r\n<span style=\"font-family: futural;\">After a nearly six month review we are pleased to report that we have received that independent validation from CloudQuant[i] with the publication of their whitepaper\u00a0<a class=\"x-el x-el-a c2-2w c2-2x c2-66 c2-v c2-w c2-x c2-j c2-67 c2-3 c2-30 c2-31 c2-11 c2-32\" href=\"https:\/\/docsend.com\/view\/xukyc8uq49tgjd6z\" rel=\"\"><em class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-64\">Outperforming the Market with Measures of Deceptive and Truthful Language in Regulatory Filings<\/em><\/a>. Specifically, they found that DATA Scores provide a significant return advantage for different capitalization investment strategies, as well as in various sector strategies. In this article we will discuss CloudQuant\u2019s independent validation that shows DATA handily beats the S&amp;P 500.<\/span>\r\n\r\n<span style=\"font-family: futural;\">Now the only question, with respect, is: Are you interested in generating alpha, or not?<\/span>\r\n<div>\r\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">DATA Handily Beats the S&amp;P 500: Major Results*<\/strong><\/span><\/h4>\r\n<\/div>\r\n<span style=\"font-family: futural;\">CloudQuant constructed a portfolio that featured a large portion of the most truthful companies as assessed by their DATA Scores. What they found is:\u00a0<\/span>\r\n\r\n<span style=\"font-family: futural;\"><em class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-64\">[* For those interested in CloudQuant\u2019s methodology we have summarized it below.]<\/em><\/span>\r\n\r\n<span style=\"font-family: futural;\">First, CloudQuant constructed a trading strategy that took advantage of some of the findings from their study. Namely, they used the period of 2008 through 2019 to define their parameters, and with a testing period of 2020 through March 2023. Also, they used the default settings described below, but with two modifications: a) the portfolio was long-only and b) the Percentile Rank Cutoff was set to the top or bottom 35%, depending on the sector.<\/span>\r\n\r\n<span style=\"font-family: futural;\">This resulted in an Annualized Net Outperformance Return of 6.26% above the equal-weighted S&amp;P 500. The idiosyncratic return registered a cumulative 86.29% outperformance for the preceding 15 years. For this trading strategy the turnover was relatively low at just 2.2%. These results assumed trading costs of 5 basis points per trade.<\/span>\r\n\r\n<span style=\"font-family: futural;\">Here are additional details from this test as shown in the whitepaper\u2019s Figure 3.5.1:<\/span>\r\n<figure class=\"x-el x-el-figure c2-1 c2-2 c2-3x c2-i c2-h c2-21 c2-2c c2-29 c2-2a c2-5i c2-4v c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\">\r\n<div>\r\n<div><span style=\"font-family: futural;\"><img class=\"x-el x-el-img c2-1 c2-2 c2-k c2-21 c2-1x c2-1y c2-29 c2-2b c2-s c2-68 c2-4f c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" title=\"CloudQuant Figure 3.5.1\" src=\"https:\/\/img1.wsimg.com\/isteam\/ip\/b4167b12-c211-4a45-9c4b-489be14138f8\/CloudQuant%20Figure%203.5.1.png\/:\/cr=t:0%25,l:0%25,w:100%25,h:100%25\/rs=w:1280\" alt=\"CloudQuant Figure 3.5.1\" \/><\/span><\/div>\r\n<\/div>\r\n<figcaption class=\"x-el x-el-figcaption c2-1 c2-2 c2-v c2-w c2-3d c2-29 c2-2b c2-49 c2-69 c2-6a c2-6b c2-6c c2-3 c2-6d c2-3e c2-10 c2-3f c2-3g c2-3h c2-3i\"><span style=\"font-family: futural;\">CloudQuant Figure 3.5.1<\/span><\/figcaption><\/figure>\r\n<span style=\"font-family: futural;\">Second, for the highest performing sectors there is outperformance for at least 80 days after a position is initiated.<\/span>\r\n\r\n<span style=\"font-family: futural;\">Third, for most of the sectors, the market does not begin to price the DATA Score assessment until 30 days after the publication of 10-Ks and 10-Qs (i.e. the day that DATA assesses these documents and publishes its DATA Scores). DATA speculates that this is because most investors respond to and price the quantitative financial results such as EPS rather than deceptiveness or truthfulness. These outcomes are driven by the management of a company\u2019s linguistic choices; which, in turn, are driven by their behaviors. DATA very specifically is designed to measure deceptive and truthful behaviors. Thus, a lagged response makes sense.<\/span>\r\n\r\n<span style=\"font-family: futural;\">Finally, because CloudQuant found that holding period is an important parameter they varied both the Lookback Period and the Holding Period and found that there is a consistent increase in Sharpe Ratio as the Lookback Period increases from 30 to 70 days. This means that DATA Scores are a useful metric throughout an entire quarter with peak performance at 20 and 80 Holding Period Days with 70 Lookback Period Days. To quote CloudQuant, they say:<\/span>\r\n\r\n<hr class=\"x-el x-el-hr c2-1 c2-2 c2-6g c2-6h c2-4k c2-29 c2-2b c2-k c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" \/>\r\n\r\n<span style=\"font-family: futural;\">\u201c[This is] a very interesting result. Typically, datasets used for investing will exhibit a very rapid (minutes to several days) decay in their returns resulting in very limited ability for large investment managers to utilize them in a significant way.\u201d<\/span>\r\n\r\n<hr class=\"x-el x-el-hr c2-1 c2-2 c2-6g c2-6h c2-4k c2-29 c2-2b c2-k c2-3 c2-4 c2-5 c2-6 c2-7 c2-8\" \/>\r\n\r\n<div>\r\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">DATA Handily Beats the S&amp;P 500: Methodology<\/strong><\/span><\/h4>\r\n<\/div>\r\n<ol>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data assessed<\/u>. CloudQuant assessed DATA\u2019s U.S. SEC 10-Q and 10-K filings dataset, DATAbase, as well as the Managements\u2019 Discussion &amp; Analysis sections of these documents.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data quantity<\/u>. In total there were 6.7 million data points from 213,522 earnings filings from 6,191 companies over 14 years.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Biases assessed<\/u>. CloudQuant tested for both LookAhead Bias and Survivorship Bias and found both to be low.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Data examined<\/u>. CloudQuant evaluated three primary outputs from DATA, our DATA Score, the average DATA Score for deceptive fragments, and the average DATA Score for Truthful fragments. Very importantly, DATA\u2019s algorithm has never seen a stock price. Instead, DATA\u2019s assessments are an NLP assessment of known-to-science behavioral differences between deceivers and truth tellers.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Transaction fees<\/u>. For each entry or exit trade they assumed a 5-basis point transaction fee.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Composition<\/u>. Their index composition was not static and was reassessed and redefined at the commencement of each calendar year to ensure that the universe of stock symbols remained representative of their respective market capitalization categories. This means that the strategies adapt to Mergers and Acquisition activity, ascendant companies, and company size dynamics.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Rebalancing<\/u>. Two methods of rebalancing were used: a) constant holding period with event-based entry; and b) daily rebalancing with a fixed lookback window and a maximum holding period.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Position Entry<\/u>. Multiple lookback periods were considered, as well looking at the percentile ranking of companies based on their DATA Scores. Once DATA Scores were converted into percentiles, CloudQuant also established threshold values for long and short positions.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Position Exit<\/u>. Two methods of exiting positions were examined; a) daily re-balancing with a fixed lookback window; and event based entry with a constant holding period.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Weighting<\/u>. Two methods of weighting portfolios was considered; a) equal weight, and b) an event-driven approach, where the sizing is adjusted based on the signal count for each entry date.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Entry Trade Execution Lag<\/u>. To account for the fact it can take a long time to fully purchase a given position, CloudQuant looked at execution times ranging from 0 to up to 50 trading days.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Holding Period<\/u>. Holding periods were varied from between 20 to 80 days. This allowed CloudQuant to evaluate the decay rate of the signals, potential investment capacity and performance characteristics for medium to long-term investments.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\"><u class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-31 c2-63 c2-66\">Portfolio Structure<\/u>. Two types of portfolio structure were considered; a) long-only, and b) dollar-neutral.<\/span><\/li>\r\n<\/ol>\r\n<span style=\"font-family: futural;\">For its back tests conducted on the S&amp;P 500, CloudQuant\u2019s default parameters were as follows:<\/span>\r\n<ol>\r\n \t<li><span style=\"font-family: futural;\">Portfolio Structure: dollar neutral.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Lookback Period: 60 days.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Cross-Sectional Percentile Ranking of raw DATA Score.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Entry Trade Execution Lag: 0 days.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Holding Period: 40 days.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Percentile Rank Cutoff Threshold: 0.50\/0.50<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">Portfolio Weights: equal-weighted and the date range is 2008 through 2022.\u00a0<\/span><\/li>\r\n<\/ol>\r\n<span style=\"font-family: futural;\">For a full list of parameters, please refer to the whitepaper, Section 2, pages 2-9.<\/span>\r\n<div>\r\n<h4 class=\"x-el x-el-h4 c2-6e c2-6f c2-v c2-w c2-42 c2-2c c2-2a c2-29 c2-2b c2-3 c2-z c2-44 c2-10 c2-45 c2-46 c2-47 c2-48\"><span style=\"font-family: futural;\"><strong class=\"x-el x-el-span c2-2w c2-2x c2-3 c2-62 c2-13 c2-3v c2-63\">What\u2019s Up Next?<\/strong><\/span><\/h4>\r\n<\/div>\r\n<span style=\"font-family: futural;\">This is the first of six articles we are authoring to summarize CloudQuant\u2019s and Solactive\u2019s independent validation of DATA Scores\u2019 ability to predict future stock price movements. Here is an overview of this and forthcoming articles:<\/span>\r\n<ol>\r\n \t<li><span style=\"font-family: futural;\">DATA Handily Beats the S&amp;P 500 \u2013 A summary of CloudQuant\u2019s large cap findings.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">DATA\u2019s Stock Picking Batting Average \u2013 Solactive found that DATA Scores make the right judgment call on stock\u2019s going up a very large amount of the time.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">DATA Handily Beats the Russell 2000 \u2013 A summary of CloudQuant\u2019s small cap findings.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">DATA Measures Something Unique \u2013 A summary of CloudQuant\u2019s latency findings that demonstrate that it takes markets many weeks to price the behaviors that DATA is measuring.<\/span><\/li>\r\n \t<li><span style=\"font-family: futural;\">DATA\u2019s Industry Performance \u2013 A summary of CloudQuant\u2019s industry findings. The report card indicates that DATA is excellent in picking some industries, less so in others.<\/span><\/li>\r\n<\/ol>\r\n<span style=\"font-family: futural;\">\u00a0\u00a0\u00a0\u00a0<\/span>\r\n\r\n<span style=\"font-family: futural;\">[i] Solactive has also independently validated the DATA platform with interesting results that will feature in a forthcoming article.<\/span>","_et_gb_content_width":"","footnotes":""},"categories":[3,465],"tags":[441],"class_list":["post-14337","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-the-blog","category-d-a-t-a","tag-validation"],"_links":{"self":[{"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/posts\/14337","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/comments?post=14337"}],"version-history":[{"count":0,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/posts\/14337\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/media\/14338"}],"wp:attachment":[{"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/media?parent=14337"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/categories?post=14337"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jasonapollovoss.com\/web\/wp-json\/wp\/v2\/tags?post=14337"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}