Tumgik
#but don't go spreading misinformation
5ummit · 5 months
Text
AO3 Ship Stats: Year In Bad Data
You may have seen this AO3 Year In Review.
Tumblr media
It hasn’t crossed my tumblr dash but it sure is circulating on twitter with 3.5M views, 10K likes, 17K retweets and counting. Normally this would be great! I love data and charts and comparisons!
Except this data is GARBAGE and belongs in the TRASH.
I first noticed something fishy when I realized that Steve/Bucky – the 5th largest ship on AO3 by total fic count – wasn’t on this Top 100 list anywhere. I know Marvel’s popularity has fallen in recent years, but not that much. Especially considering some of the other ships that made it on the list. You mean to tell me a femslash HP ship (Mary MacDonald/Lily Potter) in which one half of the pairing was so minor I had to look up her name because she was only mentioned once in a single flashback scene beat fandom juggernaut Stucky? I call bullshit.
Now obviously jumping to conclusions based on gut instinct alone is horrible practice... but it is a good place to start. So let’s look at the actual numbers and discover why this entire dataset sits on a throne of lies.
Here are the results of filtering the Steve/Bucky tag for all works created between Jan 1, 2023 and Dec 31, 2023:
Tumblr media
Not only would that place Steve/Bucky at #23 on this list, if the other counts are correct (hint: they're not), it’s also well above the 1520-new-work cutoff of the #100 spot. So how the fuck is it not on the list? Let’s check out the author’s FAQ to see if there’s some important factor we’re missing.
The first thing you’ll probably notice in the FAQ is that the data is being scraped from publicly available works. That means anything privated and only accessible to logged-in users isn’t counted. This is Sin #1. Already the data is inaccurate because we’re not actually counting all of the published fics, but the bots needed to do data collection on this scale can't easily scrape privated fics so I kinda get it. We’ll roll with this for now and see if it at least makes the numbers make more sense:
Tumblr media
Nope. Logging out only reduced the total by a couple hundred. Even if one were to choose the most restrictive possible definition of "new works" and filter out all crossovers and incomplete fics, Steve/Bucky would still have a yearly total of 2,305. Yet the list claims their total is somewhere below 1,500? What the fuck is going on here?
Let’s look at another ship for comparison. This time one that’s very recent and popular enough to make it on the list so we have an actual reference value for comparison: Nick/Charlie (Heartstopper). According to the list, this ship sits at #34 this year with a total of 2630 new works. But what’s AO3 say?
Tumblr media
Off by a hundred or so but the values are much closer at least!
If we dig further into the FAQ though we discover Sin #2 (and the most egregious): the counting method. The yearly fic counts are NOT determined by filtering for a certain time period, they’re determined by simply taking a snapshot of the total number of fics in a ship tag at the end of the year and subtracting the previous end-of-year total. For example, if you check a ship tag on Jan 1, 2023 and it has 10,000 fics and check it again on Jan 1, 2024 and it now has 12,000 fics, the difference (2,000) would be the number of "new works" on this chart.
At first glance this subtraction method might seem like a perfectly valid way to count fics, and it’s certainly the easiest way, but it can and did have major consequences to the point of making the entire dataset functionally meaningless. Why? If any older works are deleted or privated, every single one of those will be subtracted from the current year fic count. And to make the problem even worse, beginning at the end of last year there was a big scare about AI scraping fics from AO3, which caused hundreds, if not thousands, of users to lock down their fics or delete them.
The magnitude of this fuck up may not be immediately obvious so let’s look at an example to see how this works in practice.
Say we have two ships. Ship A is more than a decade old with a large fanbase. Ship B is only a couple years old but gaining traction. On Jan 1, 2023, Ship A had a catalog of 50,000 fics and ship B had 5,000. Both ships have 3,000 new works published in 2023. However, 4% of the older works in each fandom were either privated or deleted during that same time (this percentage is was just chosen to make the math easy but it’s close to reality).
Ship A: 50,000 x 4% = 2,000 removed works Ship B: 5,000 x 4% = 200 removed works
Ship A: 3,000 - 2,000 = 1,000 "new" works Ship B: 3,000 - 200 = 2,800 "new" works
This gives Ship A a net gain of 1,000 and Ship B a net gain of 2,800 despite both fandoms producing the exact same number of new works that year. And neither one of these reported counts are the actual new works count (3,000). THIS explains the drastic difference in ranking between a ship like Steve/Bucky and Nick/Charlie.
How is this a useful measure of anything? You can't draw any conclusions about the current size and popularity of a fandom based on this data.
With this system, not only is the reported "new works" count incorrect, the older, larger fandom will always be punished and it’s count disproportionately reduced simply for the sin of being an older, larger fandom. This example doesn’t even take into account that people are going to be way more likely to delete an old fic they're no longer proud of in a fandom they no longer care about than a fic that was just written, so the deletion percentage for the older fandom should theoretically be even larger in comparison.
And if that wasn't bad enough, the author of this "study" KNEW the data was tainted and chose to present it as meaningful anyway. You will only find this if you click through to the FAQ and read about the author’s methodology, something 99.99% of people will NOT do (and even those who do may not understand the true significance of this problem):
Tumblr media Tumblr media
The author may try to argue their post states that the tags "which had the greatest gain in total public fanworks” are shown on the chart, which makes it not a lie, but a error on the viewer’s part in not interpreting their data correctly. This is bullshit. Their chart CLEARLY titles the fic count column “New Works” which it explicitly is NOT, by their own admission! It should be titled “Net Gain in Works” or something similar.
Even if it were correctly titled though, the general public would not understand the difference, would interpret the numbers as new works anyway (because net gain is functionally meaningless as we've just discovered), and would base conclusions on their incorrect assumptions. There’s no getting around that… other than doing the counts correctly in the first place. This would be a much larger task but I strongly believe you shouldn’t take on a project like this if you can’t do it right.
To sum up, just because someone put a lot of work into gathering data and making a nice color-coded chart, doesn’t mean the data is GOOD or VALUABLE.
2K notes · View notes
royalarchivist · 3 months
Text
Quackity: These past days I've been in many calls, and I'm not done yet. I've spoken to a lot of people and creators. I've read your comments and I'm well aware of what needs to be done to carry out this project. I want to tell you all, beforehand, that for me the team's well-being is fundamental. I'm very involved in this topic to sort it out and I want to make that very clear. I want to tell you something... I want to tell you all that the administrative staff responsible for so much harm to the project has been fired. Specifically, those who made decisions without my permission, affecting the administrative and financial area of the project. Consequently, after this, I was in charge of doing a financial analysis that's carrying out for the QSMP.
Guys, to be really honest, it was not going to last. Therefore, I've had to make deep drastic structural changes that have lead me to reduce the performance of the server down to the most essential, and this is in order to ensure the well being of everyone involved in it. Having said this, I want to give a very important update: I want to let you all know that the QSMP will have to slow down temporarily. This is to ensure this new structure adapts to the project, because it's a restructuring that's taking place. I'm letting you know, and I reiterate, there are no voluntary positions inside the QSMP.
At the moment, there will not be any more individual update accounts of all 5 existing languages in the project. In any case, during this transition, there's going to be a temporal absence of all Eggs and NPCs. I know these are difficult changes, and I repeat, it's temporary until we adjust to these new conditions that will improve the performance of this new structure that's being made from scratch, both in the administrative and financial part. I'd like to reintegrate people fro the QSMP as time goes by if a financial viability can be found for the project Taking advantage of this update to tell you guys that within the changes of the server as it is, creators will have full control of their lore and stories. The team will not intervene in the way that it was being done. Moreover, efforts will be made to change the competitive dynamics inside the game so as to ease up the game style for the creators. Like I'm saying, all of these changes, and more, are being carrying out to have the project as best as possible, and they're being done little by little. This is a whole new structure that will ensure the best continuity and experience for the creators, the community and the team behind.
Guys, I want to make very clear that this is restructuring process, and again, it's not a fast one. The server being open does not mean everything's perfect, I understand that very well. Conversations will keep taking place, communication will continue and the constant improvement of the project as well. I ask, please, for everyone's patience and understanding regarding all changes. Please do wait for official announcements since a lot of incomplete and incorrect information is being spread. I want to tell you all something- if you don't trust in these changes or have many doubts about it, and don't want to consume any more of the project's content, I understand 100%. I have a personal commitment with the QSMP and I will work until it functions in the way it is supposed to do.
Lastly, I want to let you know that it was being worked on for months on finalizing the integration of Korean creators to the QSMP. For that reason, tomorrow we will be welcoming the new Korean creators of the QSMP, of course, taking into account all the changes I've just mentioned. I hope you can give the new Korean members warm welcome to the project. And as you know, their schedules are earlier. For everyone who would like to watch, they will be joining at 11am Mexico time and at 9am US time. Basically, I wanted to give that update regarding everything that's being done within the project. Again, thank you for your patience and understanding- these are necessary changes and I'm glad they're being done now. And many more things will keep being adjusted.
via @QuackitySubs
784 notes · View notes
alonelystargazer · 2 months
Text
the fact that this is where jjk was originally meant to end makes me believe that jjk might end like this, with Yuji at the end of his life, cursing Megumi to live on, alone, bc everyone he cared about is gone
Tumblr media
another terrible and angsty thought that will not happen but I'm thinking it anyway: what if yuji is able to get megumi out of the abyss but ends up dying, and seeing the last person megumi cared about die causes him to initiate the merger
22 notes · View notes
definitelynotnia · 4 months
Text
normalise saying "I don't have enough information on this topic to have an opinion on it"
and then staying out of it completely rather than going along with wtv opinion u imprinted from three random posts/reels/tweets and having weird misinformed debates with full confidence
#like bro it's ok to not have an opinion on something if you don't know just say i don't know and move on#there's too many fucked up things going on in the world it's perfectly natural to not have proper information on a topic of debate#just remove yourself from said debate theres no needto go marching in with limited information and spreading even more misinformation#i see so many people around my age posting random political stuff be it religious or about lgbtq or women even and they haven't read#a single article about any of these topics ever#their only source is sketchy social media posts or “dark jokes” about a certain community making them think it's cool to shit on them#or random “sigma” edits of things and suddenly the most random stuff becomes everyone's favourite mainstream political affiliation#like have you read a single policy pertaining to this government or do you have a single reason for violently hating a certain community#i understand that some people are genuinely interested in these topics and that is absolutely wonderful it's great that young people have#opinions and commentary on world issues but only when this stems from an area of genuine interest and when at least some effort to be#factual is made not when it's only done because everyone else is doing it and they have some weird sort of fomo at work or they just think#it's funny or wtv without understanding the implications of their words and actions#no one is forcing you to involve yourself in every social issue but the moment you choose to make commentary on a social issue you must take#the responsibility of educating yourself as best you can before you open your mouth
17 notes · View notes
suncaptor · 3 months
Text
never trust any narrative about medicine that talks in 100% absolutes.
#like it does not work that way.#there's degrees of safety and side effects and likelinesses that should be weighed compared to alternatives#sometimes the risk is so low it's not super worth even worrying about but negating risk at all is still a form of misinformation#you tell someone 'there's a tiny miniscule chance you could have x disease but even if you do we are researching ways to handle and fix tha#potential side effect but if you DON'T take it the chance you will have y disease is idk 10.000 times higher'#MUCH more convincing than 'this is 100% safe and necessary considering y and you're stupid to question if it's safe at all'#especially when like. the latter is literally factually untrue so there WILL be proof against it right.#the proof against it does NOT mean it's going to be statistically relevant to the general population#but if the only people who are taking it seriously are also people spreading misinformation!#then that can just be. weaponised.#whatever#incoherents#this goes for treatment not just preventative too like. if someone says ANYTHING is 100% safe well! though if you're actually#like with a doctor and they say something's VIRTUALLY completely safe that's different I am more talking about studies#bc the doctor is using language of what things actually would matter to you as a general person in their populace#however you're still entitled to know potential side effects even if they're rare and usually that's something a pharmacist legally can#explain and will be listed in the paperwork around it
7 notes · View notes
obstinatecondolement · 4 months
Text
I was watching a YouTube video about making zines and the person ended it with a call to action to be informed about Palestine and was like "And what better way than by reading zines???" There are. Um. Better ways...
14 notes · View notes
kamiitsubakii · 5 months
Text
I'm in a mood. Anyways since I'm kind of vocal about being diagnosed with DID I want to make it clear that endogenic "systems" and tulpa "systems" are NOT welcome on my page. In order to be a system (whether it's DID, OSDD, or UDD) you need to go through repetitive trauma before the age of 9/10. The point of having a system is to protect you from the trauma you experienced at a young age. So no you cannot be a system without trauma. So please dni. Which also means, DON'T fucking come into my comments harassing me on how I'm wrong.
12 notes · View notes
wowbright · 11 months
Text
Ugh.
9 notes · View notes
irlwakko · 2 years
Text
friendly reminder: having a fear of roller coasters/rides for any reason is completely valid, but spreading misinformation about the safety of roller coasters/rides is NOT <3
roller coasters and rides of all kinds are incredibly safe.
62 notes · View notes
anti-transphobia · 11 months
Text
Forever annoyed that "don't speak over marginalized people", the notion that marginalized people are already spoken over, and their oppressors need to actually listen and learn before speaking about complicated topics and need to do so in support of not OVER them, so quickly turned into "I'm not x so I can't speak on x issues". Like the "don't speak on this if you're not this" started out so well meaning because it was about people needing to actually take the time to learn before talking about issues they didn't previously understand! Now it's just an excuse for people to never learn about the issues minorities face or to actually stand up for them in any meaningful way
#forming an opinion is so natural and also important. you can't just stay 'neutral' on everything just because marginalized groups arent#a collective that either fully agrees or fully disagrees with something#you will always have 'lol im x and i dont care about bigotry' folks. always. always always always#you've gotta use your god damned brain and do what's right instead of going 'im not allowed to have an opinion on this'#it's literally just looped around to ignoring issues again. like saying 'racism is bad' isnt good enough when you stay quiet#when your friend is being racist because they're a poc being racist to another poc#and that situation is too 'unclear' for you#ive seen that happen a ton. fucking get over it. yes they're going to respond negatively to being called racist literally everyone does#get over that fear of backlash and stick up for people!!!#this is why radqueers are a plague. their entire stance is 'we dont care enough to think so everything is good and okay'#and has done horrible shit like spread RAMPANT misinformation about mental disorders such as DID#which makes life so much harder for people with DID. and all disorders as they get romanticized instead of actually understood#so the people with the '''bad symptoms''' get shunned#the amount of times I've heard horror stories of actual systems getting abused and forced into all kinds of shit because of endos.......#anyway neutral stances are for things that don't really hurt people or dont matter or#for when youre in the position of actually learning and forming a position#which in that case its meant to be temporary. temporary!!!!!#radqueers dni
6 notes · View notes
rubberduckyrye · 4 months
Text
Man people in the Genshin player base who are tired of Genshin need to like. Just stop playing.
No, that free skin selector is not to incentivize players to buy that Genshin Themed PS5. It's a perk of buying that PS5.
You should not be buying a PS5 for some throwaway primogems and a free paid skin selector.
I know everyone's mad because the data miners said the skin selector was going to be free for everyone, but. Well guess what? That's kind of what happens when you get data miners. And why I've even stopped watching the Livestreams.
Because spoiling any aspect of the game just kind of ruins it for me. The free skin? Had I, maybe, not invested in Klee's skin, I would be annoyed about learning that what I thought was going to be my chance to get a free Klee skin was a lie, but I'd still be more annoyed at the data miners and the people spreading it around than HYV. Like they're the ones who lied, HYV wasn't taking back a free gift or something.
Again.... if you're not having fun with the game, stop playing. You're not doing anyone any favors by not enjoying yourself and making it every one else's problem.
5 notes · View notes
nysus-temple · 1 year
Text
It's incredible, in the wrong way, that this needs to be said, but...
Wikipedia is not a primary source.
Look, I use it too, sometimes it can work, but you can't jump at anyone going “did you know that—” and then answering that you read it on Wikipedia. Most of the places where the info comes from are not primary sources, but secondary ( or even not true ), from later authors. And unless those authors list their actual primary sources for their works, then they're not reliable.
So, please, don't use the well-known “argument of the ignorant” to justify your facts you made up. For the ones who don't know, that argument consists in "if there are no sources saying that THIS didn't happen, then is possible it has happened and we don't know!" Those, dear, are theories, not sources nor arguments.
I know getting primary sources is hard, but that doesn't justify misinformation, sorry if that broke your heart into pieces, buddy.
18 notes · View notes
the-blackdale · 1 month
Text
..
1 note · View note
ratgirlcopia · 6 months
Text
makes a 3-hour long hbomberguy-style video that's just me trying to trace the origins of the statement that copia's favorite drink is evaporated milk.
4 notes · View notes
wild-at-mind · 6 months
Text
Honestly really upset about the James Somerton thing.
4 notes · View notes
blorbocedes · 1 year
Text
i have no source for that literally just made it up🤭
12 notes · View notes