0104 Matt Alice

已完成

创建时间: 2026-02-05 08:51:47

更新时间: 2026-02-05 09:41:45

源文件: f0.mp4

文件大小: 0.00 MB

字数统计: 38,247 字

标签:
暂无标签
处理统计

STT耗时: 29151 秒

分析耗时: 9 秒

处理流程
文件上传 完成

文件名: f0.mp4
大小: 0.00 MB

试听当前项目录音
URL直链 f0.mp4
时长: 检测中...
视频加载中,请稍候... (来自外部URL,可能需要较长时间)
语音识别 (STT)
完成
Okay. It's the amount of people do it, not the actual it's not the chance of the thing happening. It's the number of people that do the thing. We're measuring. The number of people doing the thing. Yeah. So in this one it's the number of people who say they support her, okay? So that we're interested in her level of support. The thing we're testing the test statistic is the number of successes, okay? So it's always the number of people that give a success or the number of things in always people is I guess number of successes, okay? In a in example one, sorry, it was the number of heads because we were looking at cand if it was biased about heads on up. In this one, we're interested in seeing if the support for this candidate is 40% or isn't 40%. So the test statistic is the number of people who do support her. Successes, Yeah, pretty much. Yes, it's always the number of successes. But if they ask you that they want it in the context of the question, okay. So yes, it's the number of successes, but you look at that and go, okay, what's the success? Success in this person's eyes is someone supports her. Yeah, okay. Okay. And then number part b, the hypotheses. So what's our normal ll hypothesis is going to be. It's us probability of a of a person supporting those 0.4 really yes, p equals 0.4. And then what about the alternative hypothesis? It's that this one's hard. It's that the probability does not equal to 0.4. So this is actually this is still a one tail test. So this is either a gross than or a less than Oh, but it's really it's actually really, really tricky. But where like which way it is based on the in the word we're looking at is that because that's the key part we want to see, we think they're overestiating what the researcher does. So if she's overestimating, would that make p greater than 0.4 or less than 0.4? I'll make it less than point for Yeah good. Yeah so our hypotheses are that completely get what you said not equal to and some of them will be that but we get onto the two side tests a bit later because they don't they don't quite come just yet. But with your alternative hypothesis, it is always it's just going to be one sneaky word in the question such as overesmates when you go okay, so that means p is less than 0.4 for the alternative hypothesis. Okay, brilliant. And then c, we've not looked at this sort of question yet. Okay. What can we lithe when we talk to that full, full question? For example one, but we didn't have a question like this in example one. So explain the condition under which the null hypothesis would be rejected. So if we were to do this hypothesis test, go. Oh, no, that is okay. Okay. So if we were to do this hypothesis test, what would we do next? And then we just trying to like just use the thing good. And what would we actually be trying to find out? So would say, first of all, x is bonly distributed, wouldn't we? Yeah have we got a number of trials? 2020 good. And then what do we put for probability? Yeah, good. We assume the normal ll hypothesis is. And then what we're actually trying to work out because this is the key part, what do we want the probability of. Oh, of the support of the ender. Sorry of like so in the question, we're out of those 20 people, how much support did they have? Three over 20I did that three. So something to do with three because three is the key bit here. But jowe did the coin. One. We we talked through it, but we didn't say we want x to. So I think we said there were five heads. We didn't work out x equals five. We did something else. Oh, bigger than five. Thanks, kes. Bigger than five? Yeah. Why do we do bigger than five? Because that's the thing that we want to test. So it's not always bigger than five. It won't always be greater than but so why did we go greater than last time? Because do you want to see if the coin is biased towards so do we want a greater than this time or do we want a less than this time first then okay. Why no? I mean, 嗯,Oh my God. It's. The word have be used. So we just want to prove that she's overestimating, but only the other 20 people support them means that. It's already my estimate here, right? So we're under her estimate Yeah because 40% of the 20 would be over three be alive over actually. But we don't just work out the probof, just three. So the word in that we use is always three or more extreme. Okay, so in which direction is more extreme? Less than three. Yeah, good. We want x is less than three. Okay? And look, the way you work that out is it's always away from the expected value. So you could do, which I think you've been doing sort of mentally 20 times. Point four gives you eight. So in a perfect world, if she's got 40% of the support, then eight people would say she's their supporter. Okay? So it's always away from that. What that expected value that mean if you want to call it a mean, okay, same within the coin one. Then we add 0.5 and we did eight toss is you expected four. So when it was five and we wanted five or more extreme, that's why it was greater than five. Good. Okay. So Yeah, we do this. We don't we don't actually have to do it in this question, but we get some number for that. We work out that we get some probability, then what do we do? Then we do, and then we compare it to the significance level. Brilliant. We compare it with that 5% significance level. And then that takes us to this question. So what probability do we need to be able to say it rejects the null hypothesis? It rejects the north. Yes. So what probability do we need here as an answer compared to our significance level to be able to reject the fact that she has 40% of the support. We need to be than brilliant. Yes, exactly that. Yes. If we get a value under 5%, thatwill say, well, essentially what that does is we go, okay, if she's got force parensupport the chance that three out of 20 or more extreme support her is really, really unlikely to happen. Therefore, if it's really, really unlikely to happen under the null hypothesis, we assume that the null hypothesis isn't right. Okay? We can't say with certainty because this could happen. You know I could three people out of 20 could say the supporter even at 40%. But if it's really, really unlikely, we go, Yeah, it's probably not happened. She's probably got into that because exactly what they've said here. Okay, so they've said the non ll hypothesis will be rejected, the probability of three or fewer. So that's what we got. People say they support the candidate. And if it's less than 5% again, then that's what you said. Okay. Given that this, yes, we always assume the knowledge bothesis is brilliant. And then seven a is just all about can you explain how a hypothesis test works? Can you find the two hypotheses? Can you define the test statistic? Which Yeah, I think I think you can. There's not there's not a massive amount to these to be honest. They get woryet. But these are just like the two examples we've done that are wordy. I want to skip 7.2 with fact. First, I want to talk through where Marks are on. So if we had this question and it was instead of abc, let's imagine the question was just test at the 5% level of significance the researchers claim then. So there's a markup for grants for the hypotheses. So that's mark one. Now you do that nothing to actually get one out of four, okay? They're usually at four, maybe five depending on how Nast sty it is. Okay, you should also you should define the test statistic. So you should always go x is number of people who support. But it's it's not necessarily worth a mark. Okay, but it's good practice for them. Your next solid mark is for working out this probability. Okay? So if we went, Oh Yeah, that equals, I don't know, let's pretend it, 0.04. Okay? If it is that, that's mark two. So you actually do the matyou get the second mark? Okay, the third and fourth mark come for your conclusion. So it's that paragraph that you write at the end. So the third mark comes for correctly compare it with the significance level to give the correct outcome. So sort of like in c, so let's say it was zero zero four yougo. Oh, this is less than 5%. So we can reject the null hypothesis because that's mark three. And then the fourth mark is to put into the context of the question. So you just have to look at the question and go, okay, so if let's imagine we do reject the non hypothesis, you could then say there is evidence to suggest that the election candidate has less than 40% of the residenboard. Okay? So not not too difficult to do, but that bit changes a lot per question because you've got to use whatever scenario they're given you. So this test statistics 's not necessarily worth the mark. Same with this line. But you still want to write them in because they're valid to your question. Okay. So at this point, I like to skip critical values, and I like to actually look at the full hypothesis tests, because I think once we've talked through the whole process and done the first half the process, I think it makes more sense to do the second half of the process essentially, which is where this one comes up. So one tatests, we'll start one Tawe click, two tathis is basically bullet points for how it works. We've already talked through this, but Yeah, so you you formulate a model for your test statistic, you identify your hypotheses, you work out your probability of the thing happening, you compare it with you stiits level, and then your your conclusion. Okay, as this bit says down here, you can technically also do it with critical regions and then see if your observe value falls within the critical region. However, since your calculator does all the nasty stuff for you and youhave to use a table, it's quicker not to use critical regions. Okay? So back when I did this, critical originons was a genuine, valid method because youlook up on a table and yousee all the values there anyway whereas you you you don't need to do that. So if you can avoid using the table and critical regions, I would recommend you do. Okay, so you'll see the method will be here and we'll we'll come to it eventually, but I don't think it's as useful as the textbook makes out now because you calculated as the bulk of the maths for you. So Yeah, we want method one. Okay, so method two will look more. Okay. So example five of them. So standard treatment for a particular disease has a two fifth probability of success. Certain doctor has undertaken research in this area, has produced a new drug which has been successful with eleven out of 20 patients that claims the new drug represents an improvement on the standard treatment tests, the 5% level of significance, a claim made by the doctor. So this is. First look at a full question. So that's what that's the wording of a full hypothesis test question. That's that last sentence is how they always get it's always test at the one f percent significance level, the claim made by the person in the question. So how are we going to start this off? We state the test statistic. Yeah, Yeah. So what's x? Let's call it x. What is it? I scroll it up a little bit. Is that okay? Yeah, go for it. Yeah. So that's what I did a minute again. The the test statistic is. Number of people. Sorry. Yeah Oh okay. So Yeah, the number of patients are successful, always the number of successes. Yeah okay. So in this case, a success is a patient gets treated successfully by the disease of the disease. Okay, youhave to write it a full sentence just like that. Just like Yeah x is the number of patients where the drug succeeds is plenty. Okay? They've also defined out what p is. So again, another, you don't need to, but if you don't do it, the maths isn't technically complete. Okay. So I would always start with x number of people job worked on. And then pay probability. Of success. It's always probability of success. On a person a, they don't call it person, still hold of the column. Patient found it okay. What comes after that? To ignore what they've done because I wouldn't necessarily do it in their order. Now hypothesis, Yeah good at H not ught and H1H not ught. What's H not. 0.4Yeah good where's the zero four come from? The two Oh Yeah two fifths and the H1. And H1 is. Hmm. A ability bigger than zero point. Good. Yeah. Why? Why? So I can say that's what it is down here, but why is it that Oh well, minute, let me write the floor because doctor claims that the neujob represents an improvement. George. Yeah. So so Yeah, the the only bit we get there is the Oh, is improvement. Improvement tells us that he thinks it's over 40 chance of curing the brilliant. Okay, now what do we do. Now we work out the probability. Good. Yeah. So before you do that, this is sort of where they write this. But here I would then write, assume H no is okay because for that fundthe probability we do it under the null hypothesis. So we're assuming this is then I'll write, Oh, that means x is binomial or sign number of trials. 20 good. And then our success is whatever p is or p so point four, still okay. Then jump into the probability. So what are we trying to work out the probability of. Probability of a patient of like wasorry patient getting sorry probability of x is than sorry. Sorry, I'm bigger than eleven, bigger than eleven. Nearly. We also want. To look at eleven, you Yeah good. So that therealways be a greater or equal to or a less than or equal to because it's always the value observed in the question or more extreme. Lovely. Okay, now it comes into our binomial distribution skills. How do I work out the probability of vine. One minus good x smaller than eleven good. Can I calculate or do x is less than eleven? Yeah that Oh sorry that we let me just so we need it to be a less than or equal to so itbe less than or equal to what value? Less or equal to ten. Yeah, good. Less than equal ten and then less than or equal ten we can work out. Okay. Oh gosh sorry, one sec. Okay. Yeah, it's just I forgot to Press something. 0.128. Yeah, good. 0.128. Okay, now one and then we level. Brilliant. So is it greater than or less than the significance level? It's greater than the is Yeah, good. It's over there. Was it 5%? Yeah. So what does that mean? Yeah, good. As insufficient. Evidence. To rejectaand. Then last bit put in context to the question. To reject that. I has been there is an improvement on the on the success of the new drugs. So is there an improvement? There is there might be, but there isn't. Oh, my gosh. But there's insufficient evidence to prove that there has been an improvement on the new drug. Insufficient evidence. My Heyes looks awful on here. Insufficient evidence. To suggest this new drug is better. Brilliant. Yeah, exactly that. Okay. And that is a sort of a standard binomial distribution hypothesis test question. That's how theylook and this is exactly how you want to lay out. So to find your x and your p, get you two hypotheses. Assume H note is work out right down your distribution, work out the probability of the thing in the question happening, or more extreme, very significlevel conclude and then conclude in the context of the question. So if we look at that conclusion they went with, there's not enough evidence to suggest that evidence to reject H, the new drug is no better than the alone. But it's strange because I bet in your head, because I always did, whenever I look at this question with students, eleven out of 20 is loads. But everyis really surprised by this. Looks they go eleven out of 20, 40%. Yeah, that's an improvement. Eleven is way over eight, but actually eleven out of 20 or more extreme still happens over 12% of the time, nearly 13% of the time. So it's not actually that unlikely to happen even though it sounds sounds quite quite good. He will get some of these questions like this that there's one later on with a coin flip that is really, really surprising as well actually. But you actually need quite a high number. So it's like the number of patients it needs to be quite high before you go actually Yeah now we can say the new drugs better. Okay. So it won't this is one of those questions where it won't always the answer won't always agree with what your mind thinks the answer should be okay, he's going into this one, but it looks like our fast improvement eleven s worth, right? I'd say the New York is better, but mathematically it actually isn't, not not at the 5% level of significance anyway. Okay. Method two uses Crystal regions. Essentially, what they do is they work backwards so they find what value is below 5% and then they go, okay, that value, apparently it's 30, so you need 13 or more successes to fall into that critical region. And then they compare 13 to the eleven and then do that conclusion the same. So after we've looked at crirosions, we can do it that way if we want, if you prefer that way. However, I think it's more long winded to get to that. So I don't think it's worth the hassle personally. Yeah. Look, there are some questions though that we'll ask you to find critical regions. So we need to know it, but I wouldn't use as a default method. Okay. And then that that's it to hypothesis test sted for one child really like that's that's a very very standard question where they they give you it's up. So in an exam it will always be in the context of a question. I know we add some up here. Where where's it going? Like question two where there's no there's no context, they don't really do that in an actual exam, therealways be some scenario they throw at you. So that is actually that could be in an as applied paper, okay? Which is why there's only the one example because they're all like that. If you ever read of any of them, I need the last, actually the first you actually look, I'm gonna just scroll down to my lego. So all of these ones, the first six are all no context. So beyond that, the seven omwards are the ones you're going to see an actual exam. But personally, I don't think the context makes it that much harder. Finding the alternative hypothesis is always nasty. And the conclusion is the bit that historically our found students struggle with and they struggle to understand what so so most people that I've tutored when they get to hypothesis, test, question or taught, they're quite good at getting the first mark. The you know it's a learned skill. You can identify the word for the alternative hypothesis. Not too bad. Mark two is actual maths. This one varies. It all depends on how good students are with by omdistribution. So if you find that you not getting that mark, it's because your bonomdistribution is not as good as it could be. Then the conclusion mark is where lots of students then just dropped off, loads of students will get one or two and then just move on. Because to get the conclusion Marks, you've got to actually understand how a hypothesis test works and the reason behind it, which is why we spent ages or last lesson at the start of this lesson, just talking through the process. So if you don't understand that process, the last two Marks are really, really hard to get. And it's all about that conclusion and particularly the put into the context. This last line is the bit that people just, they just can't do. Sometimes they go, Oh Yeah, it's over the significance level, insufficient evidence rejectation, and they get that. And then they either forget or don't understand us. But it's the context. Okay, so we should we do a few of these or a couple of these? Yeah sure. Yeah which one's taking you fancy? Three to ten because it had a straight of ten Yeah. There we go. Polland organization claims to support particular candidate, 35% candidate will pledge to support local charities if elected, call an organisation, think that the level of support will go up as a result. Takes a new poll of 50 voters. Okay, so tastastic null hypothesis or native hypothesis? So what they. The test statistic is that. But more than. Yeah more than 35% of them was sorry. So the not the tastastic. It's always the number of successes. So what what are they classisted as as a success in this? But there is support for a particular country. Brilliant. Yeah. So test statistic is the number of people who support candidate. Okay? So there's no there's no actual numbers involved, but it's just the number of people who do the thing we're interested in. Okay, then support the kind yes, so number of people who support the candidate. Oh, I know why I always get suffconfused for the first. It's weird on that, like. So annoyingly, sometimes this isn't worth a mark. Sometimes it's sort of lumped in with the like the hypothesis mark. It's always good to write. Because it's so it's mathematically correct. You need it there. You need to find what x is and you need to find p, otherwise your question doesn't actually make any sense. Yeah brilliant testistabecause that and we call that x normally. And then what about the null and alternative hypothesis? Yeah brilliant. And then H1. Yeah brilliant. Okay. The word you got from that is they reckon it support would go on. So Yeah you don't get much to go on but well then you've identified it. Part b is then do the hypothesis test so using the 5% level of significance. Oh, they want the critical region and only but we'll change the question. We're not going to find the critical region. We're going to we're going to do the we'll just do the hypothesis test because we haven't done critical regions yet. We'll go with we'll look ahead at c and they said 28. So we'll assume 28 people and we'll do the hypothesis test on that. Good yet 28 or more extreme. Wait, don't write x is. P would be that Oh sorry, or this thing I'd just put equals here actually. No looking for me table. What we got 50. Do I agree with your number? Yeah, I don't want to. Yeah. Okay. Yes, that's less than 5%. So what? Sufficient evidence? Good. Yeah Yeah. Move that. What's it called be. So for the first mark, I would always talk in terms of H1 or H naught to either say there's sufficient evidence to accept H zero or there's sufficient evidence to reject sufficient evidence to reject H zero ught or support H1. Okay. So either accept or reject depending on which doesn't really matter which one you go for. Yeah, if that worked. And then for that last bit, then put it into the context of the question. So what does that actually mean in terms of the polling organization? Developing. More support for the country. Yes. And there's evidence to suggest the supporgot up here. Good Yeah brilliant in the textbook there Marks for the conclusion they've got there is evidence that the candidate's level of popularity has increased. If you're ever struggling for that, put it in the context use the language that they use in the question. Yeah so just just look at what they've gone and go okay, so there is evidence to suggest the level of sport will go up or has gone up okay. So that again Yeah that I don't we the one question that wanted on a critical agent but that's fine we can amend that but Yeah that again pretty standard question they don't break it down like this though in an actual question they usually just throw the whole thing at you in one go okay but you still do this put this step so like what we did even if it just said test at the 5% level of significance, the 28 people that youstill do this exact thing anyway. Yeah. Okay. So nothing really changes here. The next bit is two tel e tests. So so far we've only looked at one tel tests. So it's always been H1. It's always been a gross than or a less than Yeah two tab test. What happens is, so let's jump straight into the example and then we'll come back to this bit here. What happens is the person who has a new hypothesis won't say they think it's gone up or down theysay in this question. It's that word there. Okay. So they're not sure which way it is. Okay, so so who's doing is it manuual or is it I don't rather full question. Over a long period of time, we found that Enrica's restaurant ratio of none veggie to veggies, two to one manuels restaurant, random sample of ten people orin one older veggie, five set level certiance test whether or not okay, so no one in particular is, but if they're not saying whether they think it's gone up or it's gone down, they say it's different. It's what we call a two titest. What happens is H1 just has a not equal to. Okay, so that's the first change thatbe the most obvious one. And then the second thing that changes is the significance levels in the question. They've set a 5% level of significance, but because it's a two taltest, each tail gets half of the significance level. Oh, what don't I do that? So when they're comparing the answer they get with the significance level, they've halved it, okay, as this behears us, so instead of it being 0.05, they've have half 0.05. Okay, and that is the only difference. Okay, so there's two differences. Your alternative hypothesis will have a not equal to and you half the significance level. That's it. Everything else stays exactly the same. The thing is. Cool. But then how do we know the. Like. Like let's say normally after the x bit like after the H zero bit, then we'll do it right for H1. So it's always so p and the number don't change. It's just a not equal to sort of a less than a row greater than does literally just cross an equal le sign up. But then what do we put in the calculatoh? I see it mean it mean so map doesn't change actually. So for this question, I keep scrolling too much. There we go. So this question, x is the number of people in the sample who order veg. So it's the number of successes in our sample. P ys, the probability that a randomly chosen person orders a veggie male significlevel and all that assume H note is so we get this spunolar distributed. We've got ten people because there's ten of them with the probability of success of one third. The thing we work out, again, it's always the chance of the thing that's observed, which was one or more extreme. Okay, in this case more extreme of one is less than or equal to one because we want to go away from the expected value. If we're expecting a third of people to order veg then three and a bit people out of ten should order of vegie. So we're going away from that three. Okay? The reason we half the significance level is because until we get that observed value, we don't know which side of the apopothesis test we're looking at. So they've gone. I think it's different. I don't know which way I think it's different. We're testing the 5% level of significance. So two and a half percent covers whether it's decreased and 2.5% covers whether it's increased. And then based on the sample, tells us which end we're actually looking at. Okay. So in the sample, only one person orveggie, because that's on the lower end. We only look at the lower end, where there's a 2.5% level of significance. If instead of one person, let's say it was eight, people would then look at the upper end and we go, okay, well, it's still 2.5%, but we're looking at the upper tail. So wehave greater than or equal to eight is that thing we were trying to find. Okay, so this bit doesn't actually change. It's always the value observed or more extreme. Okay, this this looks dodgy. Don't don't do that. Just type in new calcua. Don't do that. That's duschool. We would just do jump straight to that and I calcula okay, and then your conclusion will be a bit different. You always say it's decreased, you'll just say it's different, same as they did. Okay. I want to now take it back to this bill pijust, so we could sort of look at the theory a bit more. So one tail test is used when the claim says it's gone up or gone down. Two tais used when they don't state a direction of change or they're not sure. So sometimes itbe, phrased as manuual, thinks it is different, but it's not sure how it's changed. Okay, therebe some sort of uncertainty or theyjust go different. And there Yeah this bit here. So we need to know which tail of the distribution we're testing. You don't actually test both of them. You're only end up testing in one. So we're looking at the expected outcomes. We looked at this already actually where we take the number of trials times about the probability with the simpler questions. You were doing this just automatically. So we looked at the coin flip one and we had eight tosses. You went well, four heads because it's half of them. Okay? So the actual mathematical way you get that expected outcome, the number of trials, tiprobability. So in in ricose case, we've got ten people in our sample, but we didn't talk about that actually that they they gave the probrobertive success is a ratio because that's where the third came from because it's one out of the three. Okay? So wedo ten times third get 3.3 recurring. So that's our expected. I obviously kind of have 0.3 of a person order dering a veggie meal, but thin a bit and then we're always testing. Or more so I like to say all more extreme that they've spted out in a bit longer ways. As they said, if the abserved d value x is lower than this expected outcome, you test x is less than or equal to x. If it's higher than the expected value, then we test greater than equal to that value. Okay. It's normally, as they say, Hey, it's normally obvious which way you should test. You don't often get one that's really, really close to that expected value. Like if the expected outcome is 3.3, they probably won't give you three or four. If it's if it's one that's less, it's going to be one. If it's one or if it's one that's higher, you're looking at easily six or above. So it's usually quite obvious which way to go, okay, as say that's all the changes in the two totest. Okay. So your null no, not your null. Your alternative hypothesis will have a not equal to in it. Everything else stays the same. So so the actual maths doesn't change here. I've rupped it all out. I've croit all out. Let's actually pretend to see it. So the actual maths stays the same. And then when you compare with each significance level, you half the one in the question. Okay, they're the only two changes. Conclusion still the same. I suppose you can say the conclusion as the word different n, but you just use the language that they're using in the question. Yeah method two again, they've done it with critical regions. So with two totest, unless they ask you to, I think this is really silly to do because they had to find two critical regions. So there go that they've said that you have to be at zero or seven are above to be critical and then go ark. So one is not in the critical ystal region. So Yeah, but again, I wouldn't do this method unless they specifically tell you you have to. Okay, okay. And then again, it's just more questions like that. The first I use the first five. Yeah there's no no context to them. So well, well, they're not bad. I personally wouldn't I wouldn't even bother looking at them even even if you were looking to do loads of questions. I would jump straight at six and do six to ten because they're the ones that are more exam based and more realistic to what you're going to see. Okay, one. Yeah. Which one? Which one? You can't say. Do you have scroll? You pick one and we'll try and pick one. This time we'll look ahead at the question. We'll pick one that doesn't mention a critical region. So wehave to do it that way. We can just into the last one Yeah with your australito. How let me minimize that. Okay. So question term, stblood test is able to didise a particular disease with probability point 96. Manufacturers suggest that a cheap test will have the same probability of success. Conducts a clinical trial on 75 patients. The new test correctly diagnosed at 63 of these patients. Test the manufacturer's claim at 10% level to state ating your hypotheses clearly get. Over there first state the state the test statistic. Good idea x so what is x? So you can just call that x. Yeah, it is the testistatistic, but you can just call x just to minimize how much you've got to write. Number of successes made by the cheaper test? Yes. The number of patients correctly diagnosed? Yes. The number of successes. Good. By the cheaper test you could start made but Yeah Yeah made by the cheaper test because that's what we're testing. Lovely. What's. That's define pay as well. I was grfully for it. But what actually is it? It's the probability of what happened. Zero point so not not what numerical value is it what what actually is it measuring it's the probability of what happening Oh sorry of. In making a right diagnosis. Yeah, exactly. Here p is the probability of a successful diagnosis. So not said not Yeah but it feels weird this one because it's really, really high probability. Yeah and then alternative. Just p not equal to Yeah so it should have been obvious it was not equal to because we're in the two section. However, the language in the question is it's just the fact that they think it should have the same that tells you it's it's that so you are right. You know you are right. It is that but it should like you should have known it was that based on the fact that all of it, like we're in the two section, select that they're all detailed iled. So in an actual exam, you've got to identify that when it says they think the cheaper test will have the same probability, that means this in general, if you can't identify them saying, I think it's gone up or I think it's gone down, it will be a. Okay, now what? Yeah, good. So we say smaller equal Yeah 60Yeah that's to 60. Yeah relax. We want more extreme. You want to include the 63. You need to include the value in the questions. You need to include 63. Not that I don't imagine it makes much difference to the question, but Yeah, there you go and then you work that out. But the thing is, what if I include 63? Wouldn't that mean it's the same probability I 62? So less than to 63 and less than a to 62 will be slightly upfront. There won't be a massive difference, but there will be a little difference. But the thing is if I include 63, doesn't that mean that it has a chance of it being the same because it might be 63? So they've tested so in their sample they had 63 so we always test the thing they got in the sample or more extreme. Well, okay, so need you need the 63 as well because that's the thing they measured it to be. I don't think my table goes up that high, so I might not be able to match you with a value here. No, my table stops at n is 50. Oh, I got the answers somewhere. I have got the answers. That's fine. I can just cheat and do it that way. Wait. I think my calculator isn't doing that as well. Not doing that. What's it saying? The full screen here while you entered is not allowed in the function or command. What did you type? They type. You should try again because your calculator should do it because it's one of the graphical the normal calculator should do it. So you should do it as well. Sorry I forgot to put the 0.96I just put okay, okay. So you put 96. Yeah and that's why I didn't like it. It's a really, really small value ley though. That's tidy. Yeah. Did you get 0.0000417? Yeah, I just got it in like that's fine. Yeah, put it on the phone. That's fine. Yeah, that works. Yeah, tiny. Okay. And then what significance level do we test that against? Zero point nearly it's too. Wait, sorry, can I scroll back? Oh sorry, I'm 0.005 good. Yeah half the so it was 10% but half it because it's too. Good to it's below that. So what? To support Oh, sorry, I know I. There is sufficient evidence to support. To support that. It will have the same. It will Yeah, it will have the same probability of success, but are we supporting the null hypothesis or the aunative hypothesis? So you want you want your conclusion to be two step. So the first step is, is it for or against the old system? So is there sufficient so you said sufficient evidence to support. So which one does this support? Does this value support H1 or H zero? So. H1Yeah because if H zero is this is the chance of it happening under H zero, which is basically zero. That's as low as it can get really. So there's evidence for H1. Good. So that gets you your one on your Marks. So to get your last mark, put it into the context of the question. So what does that tell us about the new test? New test does not have a probability of 0.96 exactly that. Yeah. So the new test does not have the same probability of success as the old test. Or there's technically, there's evidence suggeit doesn't have the same. However, at that sort of probability, we can all but guarantee it. Yeah, brilliant. To it. There you go. That then is the end of hypothesis test. And other than the critical region bit, I shouldn't quite get onto our critical region bit. It's an alternative method. But also sometimes the question will ask you to find a critical region. So it is worth looking at. And if we had time now we're look at it now, but we don't unfortunately. I spoke to sort from the tutoring company yesterday about rearranging. I've not there, haven't given me a day yet. I don't know Denif I said anything to you. I know, I know in the original thing, I don't think you're back at school till next week. I Yeah because the asif is okay on the tenth and eleven, but I'm traveling the tenth to Beijing and on the eleven theybe on the play. So I if we can do it earlier, like maybe before the tenth, which of all that is I'm back at school Monday and then you're, I think you're eight hours ahead of me, so I can't do it before school and after school. The earliest I cle do is like 4:00, which for you is like midnight. How about after eleven the weekend after eleven? Because so with the weekends, I can only really do as in a different weekend. Yeah a different weekend potentially. I'll have to look at what my calendar is, but Yeah, I can have a look at that. Yeah. Okay. Yeah. Well, today that covered a lot of math again there. You know if you're doing this in school, pop off this test is it's it's a big topic and we've covered it effectively in like under two hours. So waldone, and hopefully I'll see you soon. Yeah, bye in a bit water.
处理时间: 29151 秒 | 字符数: 38,247
AI分析 完成
分析结果 (可编辑,支持美化与着色)
{
    "header_icon": "fas fa-crown",
    "course_title_en": "Language Course Summary",
    "course_title_cn": "语言课程总结",
    "course_subtitle_en": "Statistics Review: Hypothesis Testing (Binomial Distribution)",
    "course_subtitle_cn": "统计学回顾:假设检验(二项分布)",
    "course_name_en": "Matt Alice Lesson",
    "course_name_cn": "Matt Alice 课程",
    "course_topic_en": "Hypothesis Testing (One-tail and Two-tail Tests)",
    "course_topic_cn": "假设检验(单尾和双尾检验)",
    "course_date_en": "Date not specified",
    "course_date_cn": "日期未指定",
    "student_name": "Alice",
    "teaching_focus_en": "Reviewing and practicing the step-by-step process of hypothesis testing using binomial distributions, focusing on identifying the test statistic, hypotheses, calculating probabilities, and forming context-specific conclusions for both one-tail and two-tail tests.",
    "teaching_focus_cn": "回顾并练习使用二项分布进行假设检验的逐步过程,重点关注识别检验统计量、假设、计算概率以及为单尾和双尾检验形成特定情境的结论。",
    "teaching_objectives": [
        {
            "en": "Solidify understanding of the components of a hypothesis test (Test Statistic, H0, H1).",
            "cn": "巩固对假设检验组成部分(检验统计量、H0、H1)的理解。"
        },
        {
            "en": "Accurately set up one-tail and two-tail hypotheses based on contextual clues.",
            "cn": "根据情境线索准确设置单尾和双尾假设。"
        },
        {
            "en": "Correctly calculate the required probability using the binomial distribution (Binomial PD\/CD functions).",
            "cn": "使用二项分布(Binomial PD\/CD 函数)正确计算所需的概率。"
        },
        {
            "en": "Formulate a contextual conclusion by comparing the calculated probability with the significance level.",
            "cn": "通过将计算出的概率与显著性水平进行比较,形成情境化的结论。"
        }
    ],
    "timeline_activities": [
        {
            "time": "Start - Mid",
            "title_en": "Review Test Statistic and Hypotheses (Example 1)",
            "title_cn": "检验统计量与假设回顾(例1)",
            "description_en": "Discussed the definition of the test statistic (number of successes) and set up H0 and H1 for a one-tail test, identifying clues like 'overestimating' to determine direction (p < 0.4).",
            "description_cn": "讨论了检验统计量的定义(成功次数)并设置了单尾检验的H0和H1,通过‘过高估计’等线索确定方向(p < 0.4)。"
        },
        {
            "time": "Mid",
            "title_en": "Hypothesis Rejection Condition and Binomial Setup",
            "title_cn": "假设拒绝条件与二项分布设置",
            "description_en": "Explained the condition for rejecting H0 (p-value < significance level) and set up the binomial distribution parameters (n=20, p=0.4), determining the region of extremity (X ≤ 2).",
            "description_cn": "解释了拒绝H0的条件(p值 < 显著性水平),并设置了二项分布参数(n=20, p=0.4),确定了极端区域(X ≤ 2)。"
        },
        {
            "time": "Mid - Late",
            "title_en": "Practice One-Tail Test (Example 5)",
            "title_cn": "单尾检验练习(例5)",
            "description_en": "Worked through a complete one-tail test problem (drug improvement). Student correctly identified X, P, H0, H1 (p > 0.4), calculated P(X ≥ 11), compared with 5%, and concluded appropriately.",
            "description_cn": "完成了一个完整的单尾检验问题(新药改进)。学生正确识别了X, P, H0, H1(p > 0.4),计算了P(X ≥ 11),与5%比较,并得出恰当结论。"
        },
        {
            "time": "Late",
            "title_en": "Introduction to Two-Tail Tests (Example 10 & Theory)",
            "title_cn": "双尾检验介绍(例10与理论)",
            "description_en": "Introduced two-tail tests where H1 uses '≠' and the significance level must be halved (e.g., 5% becomes 2.5% for comparison). Practiced setting up a two-tail test (Example 10) and calculating the probability of the observed value or more extreme.",
            "description_cn": "介绍了使用'≠'的双尾检验,以及显著性水平必须减半(例如,5%变为2.5%进行比较)。练习设置了一个双尾检验(例10)并计算了观察值或更极端值的概率。"
        }
    ],
    "vocabulary_en": "Test statistic, Null hypothesis (H0), Alternative hypothesis (H1), Significance level (α), One-tail test, Two-tail test, Success, Expected value, Critical region, Overestimate, Insufficient evidence.",
    "vocabulary_cn": "检验统计量, 零假设 (H0), 备择假设 (H1), 显著性水平 (α), 单尾检验, 双尾检验, 成功, 期望值, 临界区域, 过高估计, 证据不足.",
    "concepts_en": "Binomial Distribution Application in Hypothesis Testing, Directionality in H1, Halving significance level for two-tail tests, Conclusion contextualization.",
    "concepts_cn": "二项分布在假设检验中的应用, H1 中的方向性, 双尾检验显著性水平减半, 结论语境化.",
    "skills_practiced_en": "Interpreting contextual clues to define H1 direction, setting up binomial distributions under H0 assumption, using calculator functions (Binomial PD\/CD), writing formal statistical conclusions.",
    "skills_practiced_cn": "解读情境线索以定义H1方向, 在H0假设下设置二项分布, 使用计算器功能(Binomial PD\/CD), 书写正式的统计结论。",
    "teaching_resources": [
        {
            "en": "Worked examples from textbook exercises (focus on contextual problems).",
            "cn": "教科书习题中的范例(重点关注情境题)。"
        }
    ],
    "participation_assessment": [
        {
            "en": "Very high engagement, actively recalling previous concepts and participating in setting up steps for new examples.",
            "cn": "参与度非常高,积极回忆先前概念并参与设置新例题的步骤。"
        }
    ],
    "comprehension_assessment": [
        {
            "en": "Strong understanding of the overall hypothesis testing framework, particularly how to determine extremity based on the expected value in one-tail tests.",
            "cn": "对假设检验的整体框架理解深刻,特别是单尾检验中如何根据期望值确定极端方向。"
        }
    ],
    "oral_assessment": [
        {
            "en": "Fluency in explaining the reasoning behind hypothesis choices and conclusion structure, although occasional hesitation when shifting between 'less than' and 'more extreme'.",
            "cn": "解释假设选择和结论结构的推理很流利,但在切换‘小于’和‘更极端’时偶尔有停顿。"
        }
    ],
    "written_assessment_en": "Student successfully applied learned procedures to set up and solve complex one-tail and two-tail binomial tests.",
    "written_assessment_cn": "学生成功应用所学程序来设置和解决复杂的一尾和双尾二项式检验问题。",
    "student_strengths": [
        {
            "en": "Quickly grasps the structure of the hypothesis test and the marking scheme logic.",
            "cn": "能快速掌握假设检验的结构和评分方案的逻辑。"
        },
        {
            "en": "Accurately identifies the 'success' definition for the test statistic (X).",
            "cn": "能准确识别检验统计量(X)的‘成功’定义。"
        },
        {
            "en": "Understands the need to halve the significance level in two-tail tests.",
            "cn": "理解在双尾检验中需要将显著性水平减半。"
        }
    ],
    "improvement_areas": [
        {
            "en": "Ensuring the conclusion strictly adheres to the context using the language provided in the question, especially when rejecting H0.",
            "cn": "确保结论严格遵循问题的上下文语言,特别是在拒绝H0时。"
        },
        {
            "en": "Rigorously including the observed sample value (x) in the 'more extreme' range (e.g., P(X ≤ x) instead of P(X < x) when necessary).",
            "cn": "严格确保观察到的样本值(x)包含在‘更极端’的范围内(例如,在需要时使用 P(X ≤ x) 而不是 P(X < x))。"
        }
    ],
    "teaching_effectiveness": [
        {
            "en": "The teacher effectively guided the student through complex contextual problems by breaking them down into manageable steps (Model -> Setup -> Calculation -> Conclusion).",
            "cn": "教师通过将复杂的情境问题分解为可管理的步骤(模型 -> 设置 -> 计算 -> 结论)有效地引导了学生。"
        },
        {
            "en": "Strong clarification provided on the subtle differences between one-tail and two-tail tests, particularly regarding the alternative hypothesis and significance level adjustment.",
            "cn": "对单尾和双尾检验之间的细微差别提供了清晰的解释,特别是关于备择假设和显著性水平的调整。"
        }
    ],
    "pace_management": [
        {
            "en": "The pace was fast but manageable, covering both one-tail and two-tail procedures in depth due to the student's solid foundational knowledge.",
            "cn": "由于学生扎实的基础知识,课程节奏快但可控,深入涵盖了单尾和双尾过程。"
        }
    ],
    "classroom_atmosphere_en": "Interactive, analytical, and focused, with the teacher ensuring the student understood the 'why' behind the mathematical steps, especially the logical shift in conclusions.",
    "classroom_atmosphere_cn": "互动、分析性强且专注,教师确保学生理解数学步骤背后的‘原因’,特别是结论中的逻辑转变。",
    "objective_achievement": [
        {
            "en": "All primary objectives relating to hypothesis setup, calculation, and conclusion formation were met through guided practice.",
            "cn": "所有与假设设置、计算和结论形成相关的首要目标都通过指导练习得以实现。"
        }
    ],
    "teaching_strengths": {
        "identified_strengths": [
            {
                "en": "Excellent use of contextual examples to illustrate abstract statistical concepts.",
                "cn": "出色地利用情境范例来说明抽象的统计概念。"
            },
            {
                "en": "Detailed breakdown of the four key marks in an exam-style conclusion.",
                "cn": "详细分解了考试风格结论中的四个关键分数点。"
            }
        ],
        "effective_methods": [
            {
                "en": "Systematically comparing the observed sample result against the expected value to determine the tail\/direction of testing.",
                "cn": "系统地将观察到的样本结果与期望值进行比较,以确定检验的尾部\/方向。"
            },
            {
                "en": "Explicitly contrasting one-tail vs. two-tail tests in terms of H1 and α adjustment.",
                "cn": "明确对比单尾与双尾检验在H1和α调整方面的差异。"
            }
        ],
        "positive_feedback": [
            {
                "en": "The student demonstrated high retention of the overall testing procedure after intense review.",
                "cn": "学生在密集的复习后,对整体检验程序表现出很高的掌握度。"
            }
        ]
    },
    "specific_suggestions": [
        {
            "icon": "fas fa-poll",
            "category_en": "Hypothesis & Conclusion Structure",
            "category_cn": "假设与结论结构",
            "suggestions": [
                {
                    "en": "Always write out the full context for the conclusion. E.g., 'There is sufficient evidence to reject H0, suggesting the new drug *is* better than the standard treatment (p > 0.4).'",
                    "cn": "始终写出结论的完整语境。例如:‘有充分证据拒绝H0,表明新药*确实*优于标准治疗(p > 0.4)。’"
                },
                {
                    "en": "When setting H1 for one-tail tests, explicitly state *why* you chose '>' or '<' based on the wording (e.g., 'improvement' means '>').",
                    "cn": "在设置单尾检验的H1时,明确说明选择‘>’还是‘<’是基于问题的措辞(例如,‘改进’意味着‘>’)。"
                }
            ]
        },
        {
            "icon": "fas fa-calculator",
            "category_en": "Calculation Precision",
            "category_cn": "计算精确性",
            "suggestions": [
                {
                    "en": "In binomial probability calculations, be extremely careful to include the observed value (x) in the region of extremity (e.g., use P(X ≤ 11) or P(X ≥ 11) as required).",
                    "cn": "在二项分布概率计算中,务必将观察值(x)包含在极端区域内(例如,根据要求使用 P(X ≤ 11) 或 P(X ≥ 11))。"
                }
            ]
        }
    ],
    "next_focus": [
        {
            "en": "Formal introduction to Critical Regions as an alternative method for hypothesis testing, specifically how to find and use them for both one-tail and two-tail tests.",
            "cn": "正式介绍临界区域作为假设检验的替代方法,特别是如何为单尾和双尾检验找到并使用它们。"
        }
    ],
    "homework_resources": [
        {
            "en": "Review the structural differences between one-tail (H1: ≠) and two-tail (H1: =\/≠) tests, paying close attention to the significance level adjustment for two-tail tests.",
            "cn": "复习单尾检验(H1: ≠)和双尾检验(H1: =\/≠)之间的结构差异,密切关注双尾检验的显著性水平调整。"
        }
    ]
}
处理时间: 9 秒
HTML报告 完成

生成时间: 2026-02-05 09:41:45

查看报告 下载报告
返回列表