You're talking about data that doesn't back the initial hypothesis. That isn't bad data in this context, and you're correct that it is still valuable for reforming hypotheses and re-running the experiment.
Bad data in this context is referring to data quality - things like inconsistent collection, inadequate/missing data, free text vs controlled input, etc. In those cases the data can become almost useless (and this is usually known by the people working on a project but not necessarily by their management). This causes pressure to turn shit into gold when that just isn't possible.
Imagine that your boss wanted you to predict what the temperature will be next Tuesday. In order to do this, your company has provided you the temperature from every Tuesday for the past 12 years. If that wasn't bad enough, at first they recorded the date in DDMMYY format but 10 years ago they switched to MMDDYY. However, some records were still collected in the legacy DDMMYY format due to lack of training in the temperature collection department, and there is no way to distinguish the correct date. Also, one employee who was close to retirement only collected the temperature as "Hot" or "Cold" because that is how he was trained to do it when he was first hired 50 years ago and he never bothered to learn the new system. Now, you can probably build a model that tracks weekly temperature over time and approximates the next Tuesday's temperature based on something like seasonality, the historical average, and the most recent Tuesday. But you'll know that it's not the best estimate, you'll know there is way better data out there, and you'll probably be able to make a simpler, more accurate estimate just by averaging the temperature from Saturday/Sunday/Monday.
Right, I do despise the Democrats but I still voted Harris for harm reduction. If anything, these 4 long years mentioned above are just going to make me despise Democrats more for fumbling the election so badly
He thought a "horse bath" was just a quick rinse off in the sink. He was inadvertently teaching ESL elementary school kids the phrase "whore's bath" which, while it is technically just a quick rinse in the sink, there is definitely different connotation.
"A friend of mine is a non-native English speaker. He teaches at an elementary school and works with ‘English as a second language’ students. He casually mentioned that he always tells his students to take a ‘horse bath’ in the bathroom sink after recess if needed. He was traumatized when I told him that he’d misheard that phrase for his entire adult life."
I mentioned this in a comment last week and was called a sexist for not supporting women, because I dared to say a Cheney endorsement was bad for Dems.
This has nothing to do with her gender. In fact, I just said 'a Cheney.' Dick Cheney also supported Kamala and that made people want to vote for her even less than Liz did. The fact that Kamala's positions are so far to the right that known war hawk Dick Cheney threw his support behind her is a BAD thing for a lot of left wing voters.
We weren't talking about people who voted for Trump instead of Kamala. We are talking about 15 million people who didn't show up because there was no one running that supported their values.
Right, "did Biden drop out" had a spike as seen in the first picture below. It's hard to tell magnitude. When comparing to another phrase, it's easy to see that the spike wasn't even close to the spike for another election day phrase: 'who is Kamala?'
Also, don't they need to run to move food through their digestive tract? Or to force themselves to cough if they have something stuck in their lungs? I think there is some sort of dependency of basic functions that relies on the movement of their lungs/stomach going back and forth while running that they can't easily do if they just stand in one place all day
Have you thought of trying to pick up another language? Started learning Spanish 4 years ago and now I can go on vacation and have conversations with locals. Also, I'm more interested in their local history because I can read it/listen to it in Spanish and practice the language at the same time.
I mean, it's pretty well documented how awful Christopher Columbus was. Even in the context of the time period: he was arrested in the new world and shipped back to Spain for a trial because he was so ruthless in his treatment of the native peoples. The myths about him being a 'great man' are all only like 100 years old.
You're talking about data that doesn't back the initial hypothesis. That isn't bad data in this context, and you're correct that it is still valuable for reforming hypotheses and re-running the experiment.
Bad data in this context is referring to data quality - things like inconsistent collection, inadequate/missing data, free text vs controlled input, etc. In those cases the data can become almost useless (and this is usually known by the people working on a project but not necessarily by their management). This causes pressure to turn shit into gold when that just isn't possible.
Imagine that your boss wanted you to predict what the temperature will be next Tuesday. In order to do this, your company has provided you the temperature from every Tuesday for the past 12 years. If that wasn't bad enough, at first they recorded the date in DDMMYY format but 10 years ago they switched to MMDDYY. However, some records were still collected in the legacy DDMMYY format due to lack of training in the temperature collection department, and there is no way to distinguish the correct date. Also, one employee who was close to retirement only collected the temperature as "Hot" or "Cold" because that is how he was trained to do it when he was first hired 50 years ago and he never bothered to learn the new system. Now, you can probably build a model that tracks weekly temperature over time and approximates the next Tuesday's temperature based on something like seasonality, the historical average, and the most recent Tuesday. But you'll know that it's not the best estimate, you'll know there is way better data out there, and you'll probably be able to make a simpler, more accurate estimate just by averaging the temperature from Saturday/Sunday/Monday.
That's bad data.