Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video PDF

COMPUTATIONAL COMMUNICATION RESEARCH 4.2 (2022) 463-485 https://doi.org/10.5117/CCR2022.2.004.GUIN Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video Benjamin Guinaudeau University of Konstanz Kevin Munger Pennsylvania State University Fabio Votta University of Amsterdam Abstract TikTok has rapidly developed from a punchline for jokes about “kids these days” into a formidable force in American politics. The speed of this development is unprecedented, even in the rapidly-changing world of digital politics. Through a combination of hashtag and snowball sampling, we identify 11,546 TikTok accounts who primarily post about politics, allowing us to analyze trends in the posting, viewing and commenting behavior on 1,998,642 tiktoks they have uploaded. We test a number of theories about how the unique combination of affordances on TikTok shapes how it is used for political communication. Compared to the dominant platform for political videos (YouTube) we f ind that a higher percentage of TikTok users upload videos, TikTok view counts are more dominated by virality, and viewership of videos are less dependent on a given accounts’ number of followers/subscribers. We discuss how these findings affect the production of content that ultimately determines the experience of TikTok consumers. Keywords: TikTok, social media, political communication, content analysis © Benjamin Guinaudeau, Kevin Munger & Fabio Votta This is an open access article distributed under the terms of the CC BY 4.0 license http://creativecommons.org/licenses/by/4.0 463 Computational Communication Research Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video Introduction to Politics on TikTok TikTok has rapidly developed from a punchline for jokes about “kids these days” into a formidable force in American politics. The speed of this development is unprecedented, even in the rapidly changing world of digital politics (Karpf, 2012). The TikTok app was released in the United States in August of 2018; it was the most downloaded App across all application stores in 20201, beating out social media heavyweights like Facebook, Instagram and YouTube. The company’s 2020 Transparency Report indicates that 104 million videos were removed from the platform worldwide and that this was less than one percent of all videos—putting a floor on the number of videos uploaded in this six-month period at over 10 billion (TikTok, 2019). This is more than double the same figure from the second half of 2019, indicating continuing blistering growth (TikTok, 2020). In 2020, the New York Times ran multiple stories about the political influence of the platform on American politics. “The Political Pundits of TikTok” (Feb 27) details the emergence of sometimes partisan-affiliated “Hype Houses” that produce political content, and included the quote that “TikTok is Cable News for Young People.” “TikTok Users, K-Pop Fans Say They Helped Sabotage Trump Rally With False Registrations” (June 21) explained the disappointing size of the audience of President Trump’s first post-pandemic-onset rally as the result of a campaign of coordinated inauthentic behaviour by political TikTok users. This paper presents a large-scale quantitative descriptive analysis of TikTok Politics. Building on Serrano et al., 2020’s analysis of 3,310 political tiktoks sampled from two hashtags and Literat and Kligler-Vilenchik, 2019’s analysis of 1,651 political tikoks sampled from two other hashtags, our analysis includes 1,998,642 tiktoks from the 11,546 accounts we have encountered and scraped as of October, 6th 2020.2 The growth of this ecosystem has been vertiginous. At the end of 2019, the accounts we analyze had uploaded 206,661 total tiktoks; as of October 2020, those accounts have uploaded almost 2 million tiktoks (1,998,642). The growth in viewership numbers is even starker. The one-billionth “play” (TikTok’s term for what YouTube calls a “view”) for these accounts occurred in September 2019; as of October 2020, their tiktoks have been viewed 25.11 billion times. 464 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video These numbers are outlandishly large even compared to more well-known online video platforms. The videos uploaded by the thirty-three channels comprising the well-established “YouTube Right” described in Munger and Phillips, 2020 received only 2.9 billion views over the years 2016 and 2017, their period of peak popularity. But the fact that these tiktoks are radically shorter than YouTube videos means that these “views” are not created equal; as a comparison, consider the dozens of tweets a user might “view” in the course of an hour spent “scrolling the feed” (Settle, 2018). To explain this difference, we reference research on other social media platforms to theorize about what makes TikTok distinct. TikTok represents the synthesis of three of the most powerful affordances in social media: the televisual medium that has always been the most broadly popular and powerful; algorithmic recommendation that structures the user’s experience to a greater extent than any major social media platform to date; and a mobile-only interface designed to take advantage of a smartphone’s user-facing camera. This article presents a theoretical lens for understanding the effects of those affordances on the production of video content for social media platforms, drawing on knowledge created through the study of other political media. To quantitatively test our theories, we use an analogous sample of political YouTubers as a reference set. Compared to the political YouTubers, we find that a higher percentage of TikTok users upload videos, TikTok view counts are more dominated by virality, and viewership of videos are less dependent on a given accounts’ number of followers/subscribers.3 In combination, these findings support our argument that a primary difference between TikTok and earlier social media platforms is that the affordances of the former shift the incentives and experience of content producers. A high percentage of TikTok users are creating videos for an audience of strangers in the hope of going viral. Although there are admittedly important novelties of TikTok from the perspective of the audience, we advocate that scholars pay equal attention to the supply-side factors that determine who makes (and how they make) the content that flows throughout the platform. What is TikTok? TikTok is a social media platform targeted at young, mobile-first users. Chinese company ByteDance owns both TikTok and its China-only cousin Douyin, which was founded in September 2016. TikTok was launched a year Guinaudeau, Munger & Votta 465 Computational Communication Research later, and kickstarted its growth in the US by acquiring and merging with lip-synching app musical.ly in late 2017. TikTok was the most downloaded app in the US in 2019, and second in the world to WhatsApp. Each tiktok is a 3 to 60 second-long video that loops when finished.4 The majority of the screen is taken up by the video uploaded by the user. The app offers an extremely wide range of options for customizing these videos, including: video taken with the user’s smartphone; photos uploaded from the web; emojis and other text superimposed on the video; and a library of filters and videodistorting effects. Other users can leave comments on each tiktok, including comment threads which the creator can choose to endorse. The bottom of the screen contains information about the “sound” the tiktok uses, which can either be user-uploaded or chosen from a library of popular sounds. Upon opening the app, the user encounters a tiktok that starts playing; this is the “For You Page,” which plays tiktoks that TikTok’s algorithm recommends for that user. To go to the next tiktok, the user swipes up. To see the account which uploaded the current tiktok, swipe right. The user’s profile is spare, with a brief bio and the catalogue of that user’s previously uploaded tiktoks. The metrics for the account include well-known follower and following numbers, but introduce a new metric that reflects the relative unimportance of “following” on TikTok: the total number of “likes” that user has received across all of their tiktoks. The presence of this metric also discourages users from deleting their old tiktoks, as is now common practice on Twitter and Instagram, because deleting old tiktoks would result in fewer total “likes” and thus harm the appearance of the overall popularity of the account. Who Uses TikTok? The rapid expansion of TikTok means that there is limited information about the platform’s user base. The best hard data comes from the company’s August 24, 2020 lawsuit filed against the Trump administration. In January 2018, TikTok had approximately 11 million Monthly Active Users (also called MAU, a standard metric for social media platforms). By June 2020, that number was 92 million MAUs.5 However, many of these users are unlikely to be politically active; according to internal company data reviewed by the New York Times, approximately a third of their US userbase is under fifteen years old and thus unable to vote and less likely to be engaged in electoral politics. Among adult users, there is a similar trend of the over-representation of young people. 466 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video Affordances of TikTok Social scientists have accumulated a wealth of knowledge about political communication on social media. We apply this knowledge to understand TikTok not as an entirely novel platform but rather as an continuation of earlier developments in social media. TikTok represents the synthesis of three powerful trends in social media: the televisual medium that has always been the most broadly popular and powerful; algorithmic recommendation that structures the user’s experience to a greater extent than any major social media platform to date; and a mobile-only interface designed to take advantage of a smartphone’s user-facing camera. The primary format for the political tiktoks we describe is the video-blog or vlog; often, the creator’s bedroom is visible and they look into the camera and either dance or emote in combination with music or superimposed textual images.6 This represents an extension of the credibility-via-relatability described by Lewis, 2020, Abidin, 2018 and other theorists of “influencers” or “microcelebrities.” The point is for the creator to communicate a “mood” or “vibe” that signals to the audience that they should take the creator seriously. This verbal description is an intrinsically inefficient way to communicate the experience of watching a tikok. This is not a coincidence: we argue that the information density of a single tiktok makes the medium difficult to describe in words. Although we attempt to be as explicit as possible in describing the affordances of TikTok in the following section, we believe that the reader is best served by spending half an hour watching political tiktoks to develop their intuitions. For all of our hypotheses, we require a comparison to another platform. There are arguments for any of several major platforms to serve as comparisons, and ultimately we advocate for comparisons across all of them. In the current paper, we use YouTube videos. The necessary metadata is tractable to collect, and the fact that YouTube and TikTok share the crucial televisual medium allows us to hold that affordance constant while varying the rest. Still, the platforms differ in many ways outside the scope of this analysis, and despite our best efforts, the sampling frames we use for each platform are not identical. Our empirical tests are thus not intended as the final word on TikTok, but the first step towards a necessary cross-platform analysis. In particular, there is a glaring absence of data from Instagram, which would be a useful recombination of many of the affordances of interest; however, it remains difficult to scrape data from Instagram using the sampling frames we employ. Guinaudeau, Munger & Votta 467 Computational Communication Research Ease of Posting, Algorithmic Rewards Algorithmic recommendation is perhaps more central to the user’s experience of TikTok than any other platform. Like YouTube (and unlike the primary functionality of Facebook and Twitter), the recommendation algorithm on TikTok can draw from the entire universe of tiktoks, not only ones created or shared by an account the user has “followed” or “liked.” One feature of any social media feed in environments of content abundance is the necessity of sorting to determine the order in which pieces of content are shown to the consumer, and thus ultimately (given a fixed time period) which pieces of content are viewed at all. TikTok’s user interface centers recommendations (on the “For You Page,” described in detail below) to a far greater degree than other platforms. The centrality of recommendation renders the process by which user feedback is translated into future user consumption through TikTok’s internal architecture more opaque than on platforms that are more reliant on following behavior. On YouTube and Instagram, the primary strategy for success is to accumulate a large following, taking advantage of the extensive audience feedback (through comments) to give audiences exactly what they want. On TikTok, however, the dominant and opaque algorithmic recommendation system defies straightforward attempts by creators to adapt in response to audience comments. This opacity also makes it difficult for researchers to collect data about how the algorithm operates in terms of what kind of content is shown to whom; a major concern about algorithmic curation on YouTube, for example, is that it increases viewership of extremist content and is thus a vector for far-right radicalization (Tufekci, 2018). The present paper cannot speak to this concern. Instead, we highlight the intersection of the recommendation algorithm with another major affordance: TikTok is primarily available as a mobile phone app, explicitly optimized for the front-facing, vertical-orientation camera that feels most natural for its mobile-native target audience. This camera style enhances the user’s sense of immersion and social presence (Wang, 2020). The combination of algorithmic recommendation and mobile-first design produces what we see as the most theoretically relevant aspect of TikTok from a content production standpoint: it lowers the barriers to entry and encourages a high number of viewers to become posters. In addition to examining the affordances of the platform itself, we argue that it is important to understand the path-dependent growth strategy that TikTok employed. Early uses and users of a platform have an outsized impact on its eventual culture and userbase. 468 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video All social media networks have to solve the problem of the initial construction of a network. No one wants to post into the void, but others don’t want to create a network tie with someone who never posts. Facebook, LinkedIn and Snapchat aimed to become essential for certain forms of social life, relying on users to import their own social networks to the platform. The way these platforms are used should be understood in terms of this network seeding strategy. Twitter and Instagram relied on a similar strategy initially, but then developed the hashtag as a way for users with similar interests to f ind each other and create follower networks (Thorson et al., 2016). YouTube is distinct in that it has a tiny ratio of media producers to consumers, allowing the platform to create specialized affordances for producers that solve the network construction problem through direct incentives (Caplan & Gillespie, 2020). TikTok tries to short-circuit this process by guaranteeing an audience for every post. When the user first downloads and opens the app, a tiktok starts playing immediately. The default feed is the “For You Page,” which will continue to provide new videos based on the extent to which the user engaged with previous recommendations. Part of this process involves recommending videos with extremely few views. TikTok has not provided public data on how prevalent this practice is, but the authors have all experienced it firsthand: while swiping through the “For You Page,” the recommendation algorithm occasionally displays a video with fewer than ten views. Tiktok production also includes a variety of menus with audio and visual effects that enable the user to create novel kinds of videos with minimal effort. This mimics Instagram’s strategy for kickstarting early growth: provide users with “filters” that make their photos look cooler. Each tiktok also has a “sound” (discussed in more detail below), allowing the user to participate in popular meme formats. Our prediction, then, is that more TikTok users will also be posters. That is, compared to YouTube, a higher percentage of the consumers of content will also be creators of video content. Empirically, it is difficult to pin down the denominator; we cannot measure how many people without accounts are watching these videos on each platform. We can, however, identify each account which leaves comments on other peoples’ videos. Each of these has a unique id that can matched to a user profile (or channel). We thus operationalize our hypothesis that TikTok is more effective at encouraging users to become video content creators by calculating the percentage of commenters on each platform who have also uploaded video content. Guinaudeau, Munger & Votta 469 Computational Communication Research H1: Among accounts that leave comments, the percentage who also upload videos will be larger on TikTok than YouTube The centrality of the algorithm enables what we call “virality-from-nowhere,” increasing the likelihood that a producer can have a tiktok seen by millions of people without first cultivating an audience. The algorithm does this by disrupting one of the most fundamental laws of political media: audiences have always been stocks, not flows. The accumulated habits and preferences of media consumers can be changed only gradually, in the aggregate, granting media organizations a degree of stability. Media business models based on formal recurring payments (subscriptions) are more stable still, but both in the context of the blogosphere (Hindman, 2008) and for web traffic more broadly (Hindman, 2018), less formal “bookmarking” or browser history-based recommendations tend towards audience stability. The web offers unfathomable consumer choice, counter-intuitively heightening our dependence on heuristics and habits. Social networks based on “following” other entities (which are then algorithmically sorted according to the accounts we interact with most often) are analogous to the browser-based reification of user habit that Hindman describes. The “For You Page” supplants “following” behavior entirely. Tiktoks simply appear on the screen, granting the platform incredible power in determining the fate of a given tiktok, whether it goes viral or “flops.” The fickleness of virality as a strategy for consistently reaching audiences in contexts with algorithmic recommendation is well-established. Early web-native media companies like Upworthy relied on viral Facebook posts to distribute their articles. Their strategy was to optimize for shareability, relying on audience engagement to produce manual re-shares of their content and to give the NewsFeed ranking algorithm signals (“likes”) that it had been trained to associate with organic audience engagement. Upworthy then saw their readership decimated by Facebook’s algorithm changes in 2014 (Munger, 2020). Facebook’s network-based model could only change the “rank” a given post would appear in the user’s NewsFeed, but TikTok can go farther: at every point in every user’s “For You Page,” the recommendation algorithm can choose from any of the trillions of tiktoks on their servers. We still don’t know much about how users interact with the app—how many people use it without following anyone, looking only at the purely algorithmic “For You Page,” and how many people use the more traditional “Accounts you Follow” option. One article, citing a private presentation given 470 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video by TikTok, claimed that 69% of the time users spend on the platform is on the “For You Page,” making it the default consumption choice (Stokel-Walker, 2020). Many of the users of the platform are keenly aware of the metrics of their popularity, and pay close attention to how each of their videos performs. This is an equalizing force for new or unpopular accounts: even without cultivating any following whatsoever, every tiktok is seen by someone. If these handful of initial viewers engage with it at all (a lower bar than retweeting on Twitter, the only other platform where this virality-from-nowhere is common), it is shown to more people. TikTok thus bundles content production and distribution more tightly than any other non-textual platform. H2: The relationship between followers and average video views is weaker for TikTok than YouTube. The combination of the complete opacity of the algorithm and the ease of posting means that there is a broad range of tiktoks that might appear while scrolling the “For You Page.” Many of these videos are similar to each other, mere iterations of the latest trend. Unlike retweets or social endorsements like play counts (already a fickle mapping from quality to success given the pathdependent nature of audience bandwagon effects (Salganik et al., 2006)), the passive nature of engagement on TikTok gives the app unprecedented discretion over the ultimate popularity of many roughly similar tiktoks. From the producers’ perspective, every tiktok has a chance to go viral—if the algorithm’s opaque transformation of initial audience feedback results in that tiktok showing up on a wide number of “For You Pages.” There thus exists something of a cargo cult of “the algorithm” as producers attempt to reverse engineer it. It is commonplace for tiktoks to be captioned that the user has been “shadowbanned” (their content is not being shown to others), and the phrase “don’t let this flop” evinces the anxiety and desire for viewership that accompanies each upload.7 The importance of the algorithm can be estimated from the variance in the viewership numbers for tiktoks created by a single user. Consider physical newspapers delivered to mailboxes and doorsteps; the readership of a given newspaper varies only slightly week-to-week. In contrast, we expect that the viewership of a given tiktok producer will vary heavily across even short time periods. Specifically, we predict that the inequality for TikTok views will be larger than for YouTube. We operationalize this concept in two ways. First, we calculate the inequality in viewership across the entire distribution of Guinaudeau, Munger & Votta 471 Computational Communication Research videos for a given account using the Gini Coefficient. Second, we calculate the ratio of the number of views on an account’s most-viewed video to the number of views on that account’s median video. H3: The within-account inequality of viewership for videos will be higher on TikTok than on YouTube. Data TikTok API Unlike social media platforms like Twitter and YouTube, TikTok does not provide an official API to share data on TikTok users and their behavior. However, their mobile application uses an internal API to retrieve data when in use. To access this private API, queries simulating browsing traffic can retrieve any information that is available to a normal user: video content, video descriptions, audio files, comments and engagement numbers such as the number of likes and views. This API offers several different endpoints, but we focus here on three endpoints that we used for our sampling: user, comment and hashtag endpoints. Given a user id, the user endpoint provides data on each tiktok produced by this user. This user history as accessed through the user endpoint is a complete snapshot of the account. The only tiktoks not returned by the API are those removed by the user or the platform. Analogously, the comment endpoint function returns all comments left on a certain tiktok given a tiktok id. These user and comment endpoints can be combined to deploy a snowball sampling procedure that is made credible by the fact that they each return the entire population (either all tiktoks produced by an account or all comments related to a tiktok). In contrast, the hashtag endpoint returns up to 2,000 tiktoks using a given hashtag but lacks any information on sampling criteria. For instance, if the account therepublicanhypehouse produced 10,000 tiktoks using the hashtag #DonaldTrump, the user endpoint could be queried for therepublicanhypehouse to retrieve all 10,000 tiktoks. The hashtag endpoint, when queried for #DonaldTrump would return at most a sample of 2,000 tiktoks, which are not randomly selected. In addition to the absence of sample bias, the user and comment endpoints represent efficient methods to quickly obtain large amount of data.8 Snowball Sampling The objective of this paper is to understand how the affordances of TikTok structure the dynamics of political discussion on the platform: what proportion 472 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video of users produce political videos, how densely attention is concentrated across those videos, and how important follower networks are for the structure of that attention. In the absence of a comprehensive list of all political TikTok accounts (and acknowledging that such a list is likely impossible), we use snowball sampling to iteratively discover new political accounts. We adopt a broad definition of political content, which includes both normative stances on society (“wealth redistribution reduces inequality”, “abortion is a crime” or “LTGBTQ+ rights must be protected”) as well as comments on daily politics (with topics such as the “2020 US election”, “#BlackLivesMatter” (BLM) or “policies mitigating the COVID-19 pandemic”).9 We define a TikTok account as political if a majority of the produced content by the account is likely to be political (i.e. above 50%). Any threshold here is of course somewhat arbitrary but we chose this line as indicative of accounts that reliably produce political content. For instance, following the BLM demonstrations in the summer 2020, many users produced content supporting or opposing the demonstrations during the summer and returned to their usual non-political content afterwards. While these users would certainly be worthy of study in their own right, we are interested in more constant uses of TikTok for political purposes and using a 50% threshold allows us to focus on this user type. We first identified 865 accounts focused on political topics through the use of the hashtag endpoint, searching for standard terms in 2020 US politics like “#politics”, “#MAGA”, and “#democrats.” After collecting the 272,546 tiktoks produced by this initial sample, we use two criteria to identify potentially political accounts: accounts who were either (1) frequently mentioned, either through duets or user tags in the video description; or (2) frequently left comments on videos in the initial sample. The assumption here is that accounts mentioned by political accounts or interacting a lot with political content are likely to be political users themselves. One advantage of this approach is that we capture content producers as well as content consumers “active” enough to leave a comment. Using this method, we identified more than 200,000 distinct accounts who produced more than 22 million tiktoks and can be described as at least “potentially political”. Once the “potentially political” accounts were collected, we selected the most active accounts and hand-labeled 1,134 additional accounts10. Combined with the original set of accounts, we obtained a sample of 1,999 accounts, which was big enough to train a classifier and generalise our hand-coding to the 200,000 potentially political accounts. We then classified the content they produced and eventually confirmed their focus on politics. To do so, we trained a neural network to predict whether a tiktok is political based on its description. Neural networks are Guinaudeau, Munger & Votta 473 Computational Communication Research powerful predicting tools, especially for text classification tasks (Chang & Masterson, 2020). We used a simple network with one hidden layer with 100 units trained on a document-term matrix (vocabulary 15,000). Since neural networks can efficiently model large amount of high-dimensional data, we do not need to preprocess the text (no vocabulary pruning, no stemming, no lemmatizing), which increases the amount of information provided to the model and the reproducibility of the classification. This is especially useful in cases like short-form social media, where much of the “text” is not intended to function like standard written English. As mentioned earlier, our training sample is made up by 1,999 hand-coded accounts (1,182 political and 817 non-political). In total, these accounts produced 934,226 tiktoks (406,315 political and 527,911 non-political). Since we use text-classification, we discarded videos with fewer than 3-word descriptions – 28% – and used the resulting 672,660 videos (345,856 political and 326,804 non-political) to train a neural network. 80% of the accounts were used for training, while 20% were kept aside for validation purposes. Although the prediction happens at the tiktok level, we split and validate at the measurement level (user): for each user we average the individual predictions and consider the resulting average probability of a user to be political. Users with an average probability higher than 50% were labeled as political and retained in the final dataset. The model achieved a prediction accuracy of 74% at the individual level and 83% at the user level (recall is 81% and precision is 87%). We applied the classifier on the full sample of 22 million potentially political tiktoks and identified 11,597 political accounts, which, in total, produced 2,026,506 videos. YouTube Dataset In order to understand how TikTok is used in light of the structure of the platform, we use an analogous dataset from YouTube as a benchmark. YouTube shares many affordances with TikTok. The most noticeable difference lies in the length of the videos (longer on YouTube) and the prevalence of channel subscription (unlike for TikTok, it is very common to pre-select content producers on Youtube by subscribing to their channel). So, in the space of affordances of interest for our analysis, these platforms are the most similar, and there is a wealth of academic experience collecting and modelling YouTube data from which we can draw. The YouTube dataset was constructed as part of a parallel project attempting to identify the universe of large political YouTube accounts. This effort was far less novel, and the presence of prior research allowed us to begin the process with a large collection of accounts. The final dataset combines 474 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video Figure 1. TikTok Duration the channels identified by Munger and Phillips, 2020 with an analogous snowball sampling based on the transcripts of the videos (if other accounts are mentioned) and the accounts recommended by the YouTube algorithm. The YouTube dataset consists of 1,712 political channels, which produced a mean of 700 videos each. Data was collected in May 2020 and entails both account-specific metrics (number of subscribers, number of videos, total number of views) as well as video-specific metrics (views and comments). In total, it includes just over 1,000,000 YouTube videos with a total of more than 300,000,000 comments.11 Although our sampling frames for the two platforms are broadly similar, they are not identical — and even if the process we followed were the same, the way that process interacted with each platform would be different. Therefore, the results are presented alongside bootstrapped samples for each platform, providing evidence that our empirical tests are robust to the construction of the sample. Results Description One key feature of our data that has not, to our knowledge, been demonstrated elsewhere is the distribution of the duration of tiktoks. Formally Guinaudeau, Munger & Votta 475 Computational Communication Research Figure 2. Higher Views to Followers Ratio on TikTok constrained to be one minute or less12, the empirical distribution displays a striking bi-modality, suggesting that there are at least two distinct genres of political tiktok in terms of their narrative structure. Figure 1 shows that the majority (57.55% of tiktoks are between 5 and 20 seconds long, peaking at 15 seconds long). There are very few tiktoks between 18 and 55 seconds long, but there is another significant cluster at the very top of the distribution, peaking at 58 seconds long. These results give important context for thinking about how tiktoks differ from YouTube videos that cannot be found in the formal constraints of each. However, there are related data point that we do not have access to: the duration a viewer has to spend on a given tiktok or YouTube video before the platform records this as a “view,” and the way the platforms treat repeat viewership of a given video. The latter is an especially important facet of TikTok, given the possibility of looping a single video over and over again. We thus use the number of views per video reported by the platforms’ APIs in our analysis, but highlight this potential limitation. Panel A of Figure 2 displays evidence that serves as a face validity check of our data: the ratio of views to followers is higher on TikTok than YouTube. We present the following results in the same format: the data for YouTube are shown in red and TikTok in blue, with the shade of each color matching the color of the respective company’s logo. Here, each observation is an account that uploads videos. The statistic of interest is the ratio of the total number of views on all videos uploaded by an account to the number of followers/subscribers that account had at the time we scraped the data. To account for outliers, we calculate the difference in medians of this statistic 476 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video Figure 3. More Commenters Also Create Videos on TikTok between TikTok (median = 113) and YouTube (median = 103) accounts. A Wilcoxon rank sum test indicates this difference is statistically significant and substantively meaningful, W = 8375238, p < 0.001, r = 0.04, CI95%[0.02, 0.06]. The distributions of the ratios are both roughly normally distributed, with the TikTok distribution shifted upwards, indicating that the total views to followers ratio is indeed higher on TikTok (although bootstrapped medians for this metric shows a slight overlap between the 5th and 95th percentile of both distributions as depicted on Panel B of figure 2). Hypothesis Testing We now use these datasets to test the hypotheses proposed above. We calculate differences in statistical significance using non-parametric unpaired two-sample Wilcoxon rank sum tests for group differences and Fisher’s 1925 z-test for correlation coefficient differences, implemented via the R package cocor (Diedenhofen & Musch, 2015). In addition, we used bootstrapping to investigate the sample sensitivity of our results. The bootstrapped results are obtained in the following way: for each platform we sample 20% of the posts for 500 iterations and for each iteration we take the mean of the metric of interest. We then compare the 5th and 95th percentile of the distributions for each platform, considering a result robust if these “confidence intervals” do not overlap. In regards to H1 we find supporting evidence in our data: among accounts that leave comments, the percentage who also upload videos is larger on TikTok than YouTube. Here, each observation represents an account that posted at least one comment on one of the videos in our data. Due to API Guinaudeau, Munger & Votta 477 Computational Communication Research Figure 4. Subscriber Count Explains More of the Variance in YouTube Views cap restraints, we only collect information about the number of videos uploaded by these commenters, not their content. The statistic of interest is the distribution of the number of videos uploaded by these commenters. These distributions are visibly distinct, with a much higher percentage of YouTube commenters never uploading a video. To give an intuition using an arbitrary threshold: on YouTube 18.47% of commenters created at least 5 videos compared to 78.05% of commenters on TikTok. Panel A of Figure 3 shows the distribution of uploaded videos per commenter per platform and Panel B shows the bootstrapped median distributions. The huge gap between the two distributions in Panel B and a non-overlapping 5th and 95th percentile confirm the previously noted pattern that commenters produce more videos on TikTok compared to YouTube. Panel A of Figure 4 displays evidence in support of H2: the relationship between followers and video views is weaker for TikTok. Here, each observation is an account that uploads videos. We run a log-log regression of the number of followers/subscribers each of these accounts had at the time we scraped the data on the median of the number of views on all videos uploaded by that account. In all cases, the subscriber/follower account is able to account for the majority of the variance in views. For YouTube, the R2 is higher than for TikTok. Reviewing the results in greater detail, we can observe that on TikTok a 1 percent increase in followers is associated with a 0.54 percent increase in median video plays per account on TikTok (b = 0.54, r = 0.75, R2 = 0.56). In comparison, we can observe that on YouTube a 1 percent increase in subscribers is associated with a 0.73 percent increase 478 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video Figure 5. Peak-Median Views Ratio is Higher on TikTok in median video plays per account, showing that a follower yields more average video plays on YouTube than on TikTok (b = 0.73, r = 0.82, R2 = 0.67). Next, in order to test whether the higher correlation between followers/ subscribers and average video plays on TikTok is statistically significantly, we again perform a z-test. This is indeed what we find: the difference between the two correlations is statistically significant (z = 6.55, p <.001, rMedianDiff = 0.07, CI95%[0.05, 0.09]). This confirms the relationship expected by H3, although the correlation difference is somewhat small (varying between r = 0.05 and 0.09). In order to test whether this small difference is potentially an artifact of our sample, we calculate the bootstrapped results. Panel B of Figure 4 shows that the mean correlation for YouTube is higher, as shown before. However, the 5th and 95th percentiles of both platforms overlap substantially, which means this difference is not robust to permutations of our sample. Finally, Figure 5 displays evidence that supports H3: the ratio of viewership for a single accounts’ most popular video to their average video viewership is higher on TikTok than on YouTube. Each observation is again an account that uploads videos. The statistic of interest is the ratio of the number of views on their most popular video to the median number of views their videos get. The distributions are similar in shape, but shifted upward for TikTok. The median value of the peakmedian ratio is 64 for TikTok, meaning that the median accounts’ most popular video has 64 as many views as their average (median) video. For YouTube, that number is only 40. A Wilcoxon rank sum test indicates the differences between the Peak-Median Play Ratio on the two platforms are statistically significant, W = 9048077, p < 0.001, r = 0.05, CI95%[0.04, 0.07]. Guinaudeau, Munger & Votta 479 Computational Communication Research Figure 6. Peak-Median Views Ratio is Higher on TikTok The results indicate that the Peak-Median Play Ratio is, as expected by H3, higher on TikTok. As a robustness check to this relatively fragile estimator, Panel A of Figure 6 plots the Gini Coefficients for views for a given account. Checking the values for both platforms, the Gini Coefficient indicates severe inequality with a median of 0.62 for YouTube and a median of 0.70 for TikTok. A Wilcoxon rank sum test indicates the differences between the two platforms are statistically significant, W = 6897485, p < 0.001, r = 0.09, CI95%[0.07, 0.11]. The results indicate that the Gini Coefficient of views is higher on TikTok, meaning greater inequality in the distribution of video views as we would expect in H3. To illustrate the relation, we can also take a look at how many views per account are coming from which percentage of videos: on TikTok the top 20% of the videos rake in 75.76% of views whereas on YouTube the top 20% only get 72.52% of all views per account, on average (mean). Finally, Panel B of Figures 5 and 6 show the bootstrapped mean values of peak-median views ratios and gini coefficients per platforms. The non-overlapping 5th and 95th boundaries between the two platforms suggests the evidence for H4 is robust to our sample construction. Conclusion Both the history of broadcast media and present trends in usage rates suggest that video is the future of social media. YouTube is the current king of online video, and is the most popular social media platform among young people 480 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video in the United States. TikTok offers certain affordances that had never been combined into a single social media platform. Even if Snapchat pioneered mobile-first video editing technology and low-friction video production, it is a fundamentally closed system that relies on users to curate their networks for video distribution. YouTube’s recommendation algorithm allows it to show videos to people who have never opted into the creators of those videos, but as our results have shown, following behavior remains very important. A typical YouTube video is also at least several minutes long and its production requires substantial effort. TikTok concentrates these affordances, which, as our study demonstrates, have significantly shifted the incentives for the creation of video content, the importance of cultivating followers, and the distribution of video popularity for a given account. Through comparison of a dataset of political TikTok metadata with an analogous dataset from YouTube, we find support for each of our theoretically-driven hypotheses. Tiktoks have more views, on average, than do YouTube videos, and the distribution of these views are less easily modeled by looking at an accounts’ follower/subscriber count; the latter finding, however, is dependent on the composition of our samples. The viral potential of TikTok is higher than YouTube, reflected in the former’s higher peak-mean ratio of views and Gini Coefficient of views across a single account. Finally, TikTok is dramatically more successful in encouraging the majority of its users who leave comments to also produce videos, deepening and broadening this engaging behavior. In the aggregate, these results inform our understanding of how the “supply side” of content on TikTok differs from other platforms. Social media platforms are created by their users; TikTok encourages more of their userbase to create televisual content aimed at a large, public audience than any previous platform. More broadly, our experience in conducting this research suggests that TikTok is an unusually dynamic object of study. The platform has seen explosive growth during the 18 months of data we analyze, reflected in the aggregate number of videos produced and the rate at which they have been viewed. This expansion of the userbase beyond the (still prominent) teenagers making dance videos has similarly led to an explosion in variance in how TikTok is used. For example, the prominent Black Lives Matter Protests in the summer of 2020 were reflected in both heated political discussions in the traditional form of bedroom vlogs as well as the kind of evidentiary protest videos more commonly associated with Twitter. The supply-side affordances we discuss above allow TikTok to update the “virality-from-nowhere” so Guinaudeau, Munger & Votta 481 Computational Communication Research distinctive to Twitter to the medium most natural to younger Americans: mobile video. In tandem, fringe or conspiratorial viewpoints have flourished, and the rapid growth of this content suggests that TikTok lacks the organizational capacity of more established platforms to properly mitigate it. Furthermore, the same frictionlessness that has enabled its rapid growth and which entices so many different individuals to upload videos has proven to be a vector for attacks: in September 2020, a coordinated group uploaded thousands of minorly edited versions of a suicide. TikTok’s ease of account creation, the auto-play video feed, and the powerful recommendation algorithm that enlists viewers to informally vet videos from unknown accounts made its users more vulnerable to accidental exposure than other platforms’. There is thus another through line to our theoretical and empirical results. Yes, TikTok has encouraged a wider range of users to produce video content. But the same affordances that accomplish this also allow the platform to claw back a significant amount of the “producer surplus” that accrues to the content creators on other platforms. These platforms—as well as traditional media firms—are dependent on their most successful content creators to attract consumers. But if these superstar creators are sufficiently popular, they can credibly threaten to leave the platform and take their audience with them. The theoretical motivation behind each of the three hypotheses we test can be understood as contributing to TikTok’s leverage over their creators. H1, that more commentors will also upload videos, means that the pool of potential creators is larger; you might be a good dancer, but you’re not that much better than hundreds of other dancers waiting in the wings. H2 shows that follower counts are less predictive of video viewership. The user experience of the platform downplays the importance of creator accounts or pages, and TikTok’s opaque algorithm mediates and obscures the relationship between creator and consumer. H3 demonstrates a far greater dispersion in the viewership of a single account’s videos on TikTok; for the creator, this means less security in the size of their audience and a consistent pressure to seek out viral hits. Tying these effects together, we explain much of TikTok’s explosive success as a product of the aggregate effort it has enticed creators to exert, at both the “extensive margin” (more different creators) and the “intensive margin” (more effort from each creator). Clearly, the media product the platform offers consumers is novel and engaging—but the “demand side” is literally only half the story. TikTok’s novel combination of affordances, including its powerful algorithm, also affects the “supply side.” 482 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video One implication of this analysis—which cannot be tested yet, because TikTok has not been around long enough—is that there will be more churn among the popular accounts than on other social media platforms. Creators can blow up overnight, but this dynamic is always in effect; the competition is concomitantly more intense. This expectation is borne out qualitatively. In a journalistic account of hype houses, Swason, 2021 describes “The Anxiety of Influencers.” The teenage superstars of the app are recruited from across the country; in the words of a press liason for one of the hype houses, “You can be in Cleveland, Ohio, alone in your bedroom, and you can get a million followers overnight.” But staying on top requires constant, exhausting effort. The long-term winners are the people running things. The CEO of one of the hype houses admits that “Only a very small percentage of these kids will actually make it in the industry; the rest of them…will eventually just ’cycle through.’” Fifteen seconds is the modal length of a tiktok, but it also reflects the extent of what TikTok is offering its millions of content creators. Author Note Correspondence concerning this article should be addressed to: Benjamin Guinaudeau, Department of Politics and Public Administration, University of Konstanz, Universitätsstraße 10, 78464 Konstanz, Germany. E-mail: [email protected] Notes 1. 2. 3. 4. 5. https://appfigures.com/resources/insights/most-downloaded-mobileapps-2020 We encourage the use of capital-T TikTok to describe the platform and lowercase-t tiktok for each individual post. For replication data and code visit the Open Science Framework (OSF) project page: https://osf.io/c4xq3. In July 2021, TikTok increase the limit from 60 seconds to three minutes. The platform also allows users to create tiktoks that are longer than three minutes if they use video uploaded from another source. This seems to be a rare practice. It bears mention that TikTok’s global userbase is far larger: it grew from 55 million MAUs to 689 million MAUs in that same time period. These figures do not include any users in China, where Bytedance operates TikTok’s sister company Douyin. Guinaudeau, Munger & Votta 483 Computational Communication Research 6. 7. 8. 9. 10. 11. 12. One crucial component of TikTok’s explosive growth is the global pandemic that hit the US in March 2020. The dominance of the in-home, vlog style of tiktok is related to the inability to leave the home. This uncertainty about viewership mimics the logic of variable rewards that BF Skinner found to be the most effective schedule for operant conditioning. This insight has long been used by designers of machine gambling devices to optimize their slot machines for addiction (Schüll, 2014), and has more recently been applied to video game “loot boxes” (where rewards for achievements take the form of a random prize), which have also been shown to have an addictive quality (Drummond & Sauer, 2018). For more detailed information on the retrieving procedure, please look at the documentation of tiktokr (https://github.com/benjamingui Also notice, that following a security update in the last quarter of 2020, it is now much more difficult, if not impossible, to exploit the API to retrieve large amount of data. Religious groups constitute big communities on TikTok and who frequently mention keywords vaguely related to politics, but not primarily so (for example: “public prayer” or “biblical”). This kind of religious content was found to mostly not relate to politics and was thus coded as non-political. Most of the accounts were coded by at least by two different coders. InterCoder Reliability was 0.87. Interestingly, the videos in the two datasets gathered a similar amount of comments (400 comments/video for YouTube and 350 comments/video for TikTok). In July 2021, after the completion of this study, TikTok increased its video length limit from 60 seconds to three minutes References Abidin, C. (2018). Internet celebrity: Understanding fame online. Emerald Publishing Limited. Caplan, R., & Gillespie, T. (2020). Tiered governance and demonetization: The shifting terms of labor and compensation in the platform economy. Social Media+ Society, 6(2), 2056305120936636. Chang, C., & Masterson, M. (2020). Using word order in political text classification with long short-term memory models. Political Analysis, 1–17. Diedenhofen, B., & Musch, J. (2015). Cocor: A comprehensive solution for the statistical comparison of correlations. PloS one, 10(4), 1–12. Drummond, A., & Sauer, J. D. (2018). Video game loot boxes are psychologically akin to gambling. Nature Human Behaviour, 2(8), 530–532. Fisher, R. (1925). Statistical methods for research workers. London: Oliver and Loyd, Ltd, 99–101. Hindman, M. (2008). The myth of digital democracy. Princeton University Press. 484 VOL. 4, NO. 2, 2022 Fifteen Seconds of Fame: Tik Tok and the Supply Side of Social Video Hindman, M. (2018). The internet trap: How the digital economy builds monopolies and undermines democracy. Princeton University Press. Karpf, D. (2012). Social science research methods in internet time. Information, Communication & Society, 15(5), 639–661. Lewis, R. (2020). This is what the news won’t show you”: Youtube creators and the reactionary politics of micro-celebrity. Television & New Media, 21(2), 201–217. Literat, I., & Kligler-Vilenchik, N. (2019). Youth collective political expression on social media: The role of affordances and memetic dimensions for voicing political views. New Media & Society, 21(9), 1988–2009. Munger, K. (2020). All the news that’s fit to click: The economics of clickbait media. Political Communication, 37 (3), 376–397. Munger, K., & Phillips, J. (2020). Right-wing youtube: A supply and demand perspective (forthcoming). International Journal of Press/Politics. Salganik, M. J., Dodds, P. S., & Watts, D. J. (2006). Experimental study of inequality and unpredictability in an artificial cultural market. science, 311(5762), 854–856. Schüll, N. D. (2014). Addiction by design: Machine gambling in las vegas. Princeton University Press. Serrano, J. C. M., Papakyriakopoulos, O., & Hegelich, S. (2020). Dancing to the partisan beat: A f irst analysis of political communication on tiktok. arXiv preprint arXiv:2004.05478. Settle, J. (2018). Frenemies: How social media polarizes america. Cambridge University Press. Stokel-Walker, C. (2020). Inside tiktok’s latest big pitch to advertisers with new numbers showing time spent on the app and engagement metrics. Business Insider, https://www.businessinsider.com/leaked-tiktok-slides-engagement-timespent-activity-2020-9. Swason, B. (2021). The anxiety of influencers. Harpers Magazine, June. Thorson, K., Edgerly, S., Kligler-Vilenchik, N., Xu, Y., & Wang, L. (2016). Climate and sustainability| seeking visibility in a big tent: Digital communication and the people’s climate march. International Journal of Communication, 10, 23. TikTok. (2019). Tiktok transparency report 2019 h2. TikTok Safety Center: https:// www.tiktok.com/safety/resources/ report. TikTok. (2020). Tiktok transparency report. TikTok Safety Center: https://www.tiktok. com/safety/resources/transpar report-2020-1. Tufekci, Z. (2018). Youtube, the great radicalizer. The New York Times, 12, 15. Wang, Y. (2020). Influence of camera view on tiktok users’ presence, immersion, and adoption intent. Computers in Human Behavior, 106373. Guinaudeau, Munger & Votta 485

Fifteen Seconds of Fame: TikTok and the Supply Side of Social Video PDF

Document Details

Tags

Related

Summary

Full Transcript

Upgrade to continue