Why is p-value greater than 1 when it should not

**gko_87** · 10-11-2017, 04:46 PM

After doing some searching online, I realized that p values should not be greater than 1. Somebody please tell me what I am doing wrong here:

Final Template2.xlsx

**joeu2004** · 10-11-2017, 04:57 PM

Why are you multiplying T.DIST by 2?!

I wonder if you want:

=2*(1 - T.DIST($O6,$N6,TRUE))

which is equivalent to

=T.DIST.2T($O6,$N6)

**gko_87** · 10-11-2017, 05:07 PM

Kindly refer to problem 2 here:

http://stattrek.com/hypothesis-test/...px?Tutorial=AP

Change null and alternative from:

Null hypothesis: μ1 - μ2 >= 7
Alternative hypothesis: μ1 - μ2 < 7

To:

Null hypothesis: μ1 - μ2 = 0
Alternative hypothesis: μ1 - μ2 not equal

**joeu2004** · 10-11-2017, 05:53 PM

[.... deleted by me ....]

**joeu2004** · 10-11-2017, 06:04 PM

[.... deleted by me ....]

**MrShorty** · 10-11-2017, 06:17 PM

As near as I can tell, the calculation programmed into the spreadsheet is correct. I put those parameters (t=2.24 and df=145) into stattrek's t-distribution calculator (link in tutorial page) and into this calculator http://www.danielsoper.com/statcalc/...tor.aspx?id=41 and got the same answer as your function did. So this appears to be the correct result for the cumulative distribution function. This table of cumulative probabilities (http://www.sjsu.edu/faculty/gerstman...er/t-table.pdf ) shows many entries that are greater than 1. Are you certain that your calculation must be less than 1?

I have not thought through this fully, but is it possible that you intended to compute the probability density function (3rd argument of T.DIST() function is 0 or FALSE)? The probability density function will return values much closer to 0 (and probably cannot return a value greater than 1).

It has been a long time since I looked at t-tests, so I don't remember the details very well. I would suggest that you need to check your procedure and make sure you are clear on what the p value represents.

**joeu2004** · 10-11-2017, 06:20 PM

[.... deleted by me ....]

**joeu2004** · 10-11-2017, 06:27 PM

Originally Posted by joeu2004

[.... deleted by me ....]

Sorry for the misdirections. I got caught up in the tutorial examples, and I lost sight of your original question.

Bottom line: It does appear that you are doing the correct calculations.

Off-hand, I cannot explain why the p-value exceeds 1 in this example. I suspect it is due to a misinterpretation of the problem or the structure of a solution for two-tailed problem. But honestly, it has been too long since I did hypothesis testing, and I do not have time to delve into this further.

**gko_87** · 10-11-2017, 06:27 PM

P values should not be greater than 1. They will mean probabilities greater than 100 percent.

**gko_87** · 10-11-2017, 06:33 PM

Consider for example the P value in my attachment:1.973

If multiplied by 100 to convert to percentage you get 197.3%.

Is this realistic?

**gko_87** · 10-11-2017, 06:37 PM

According to explanation here the p shouldn't be greater than 1:

https://socratic.org/questions/can-a...why-or-why-not

**gko_87** · 10-11-2017, 07:02 PM

How can I perform the 3 t tests with the following as the alternative hypotheses:

1st - test for the equality of the means

2nd - if mean 1 is greater than mean 2

3rd - if mean one is less than mean two.

**joeu2004** · 10-11-2017, 09:02 PM

Originally Posted by Onditi

Consider for example the P value in my attachment:1.973 [....] Is this realistic?

No.

Aha! Part of the problem is: the old TDIST function, which I am used to, is not parameter-for-parameter compatible with the new T.DIST function.

Since you want to test u1-u2=0, you should use a two-tailed test. Enter one of the following formulas into P6:

=T.DIST.2T(ABS(O6),N6)

or

=TDIST(ABS(O6),N6,2)

or

=2*TDIST(ABS(O6),N6,1)

or

=2*T.DIST(-ABS(O6),N6,TRUE)

I can explain further later, if you wish. I have to leave now.

**joeu2004** · 10-12-2017, 03:44 AM

Originally Posted by joeu2004

Part of the problem is: the old TDIST function, which I am used to, is not parameter-for-parameter compatible with the new T.DIST function.

And the various descriptions of Student's t are inconsistent, not only among themselves, but also within themselves.

For example, consider the stattrek.com tutorial. On the one hand, it says that the Student's t-statistic is calculated by the formula t = [ (x1 - x2) - d ] / SE = -1.99. On the other hand, it says that the p-value is interpreted as P(t < -1.99) = 0.027; that is, the probability that t < -1.99. In the latter context, t refers to a random variable, not the Student's t-statistic (which is -1.99).

The stattrek.com Student's t calculator uses the correct nomenclature, namely: P(T < t). T is the random variable that has a Student's t-distribution with df degrees of freedom, and t is the Student's t-score for the sample data.

Originally Posted by joeu2004

Since you want to test u1-u2=0, you should use a two-tailed test. Enter one of the following formulas into P6:
=T.DIST.2T(ABS(O6),N6)
or
=TDIST(ABS(O6),N6,2)
or
=2*TDIST(ABS(O6),N6,1)
or
=2*T.DIST(-ABS(O6),N6,TRUE)

Your formula 2*T.DIST(O6,N6,2) works for the tutorial problem #1 only because the t-score in O6 is negative. (And because T.DIST interprets any non-zero numerical value as TRUE in the 3rd parameter.)

T.DIST(t,df,TRUE) returns P(T < t) for t <= 0 and t > 0. But for a two-tailed test and t > 0, we want P(T < -t) + P(t < T); that is, the sum of the cumulative tail probabilities. Since the Student's t-distribution is symmetrical, P(T < -t) = P(t < T), and P(T < -t) + P(t < T) = 2*P(T < -abs(t)). So we can write 2*T.DIST(-ABS(t),df,TRUE), which is what T.DIST.2T(ABS(t),df) calculates.

TDIST(t,df,1) returns P(t < T) for t > 0. So we can write 2*TDIST(ABS(t),df,1), which is what TDIST(ABS(t),df,2) calculates.

**joeu2004** · 10-12-2017, 04:04 AM

Originally Posted by Onditi

How can I perform the 3 t tests with the following as the alternative [sic] hypotheses:
1st - test for the equality of the means
2nd - if mean 1 is greater than mean 2
3rd - if mean one is less than mean two

I think you mean: those are the null hypotheses, consistent with the stattrek.com tutorial.

u1-u2 = 0, a two-tailed test: T.DIST.T2(ABS(t),df)
u1-u2 >= d, a one-tailed test: T.DIST(t,df,TRUE)
u1-u2 < d, a one-tailed test: 1 - T.DIST(t,df,TRUE)

d = 0 for u1>=u2 and u1<u2.

As the stattrek.com tutorial explains, reject the null hypothesis when the P-value returned by the appropriate expression above is less than the significance level (typically 0.01, 0.05 or 0.10; but it can be anything between 0 and 1 non-inclusively).

**gko_87** · 10-12-2017, 04:08 AM

Thank you Joe. Will run some tests then get back here.

**gko_87** · 10-12-2017, 04:11 AM

I really appreciate you taking the time to take me through this.

**gko_87** · 10-12-2017, 06:23 AM

Would you be kind enough to display how you would state the null and alternative hypothesis for μ1 > μ2 and μ1 < μ2. Do i compute another set of degrees of freedom and t when testing for μ1 > μ2 and μ1 < μ2, or do i use the same values I used for for μ1 = μ2?

**joeu2004** · 10-12-2017, 08:41 AM

Originally Posted by Onditi

Would you be kind enough to display how you would state the null and alternative hypothesis for μ1 > μ2 and μ1 < μ2. Do i compute another set of degrees of freedom and t when testing for μ1 > μ2 and μ1 < μ2, or do i use the same values I used for for μ1 = μ2?

As I understand it (not an expert), yes: referring to the stattrek.com tutorial, SE, df and t would be the same because d is simply zero.

I provided the null hypotheses in post #15. Note that I corrected one mathematical expression.

If algebra is not your strength, here is something more complete.

Please Login or Register  to view this content.

According to one or two webpages, we do not always accept the alternate hypothesis when we reject the null hypothesis.

I am not citing the webpages because their authority and descriptions are unclear. You might do your own research, if it matters to you.

-----

The expressions for #3 assume that t is calculated the same way as for #2 (my preference).

Alternatively, calculate t as follows:

t = (d - (u1-u2)) / SE

and reject the null hypothesis when:

T.DIST(t,df,TRUE) < significance level

Why is p-value greater than 1 when it should not

LinkBack

Thread Tools

Rate This Thread

Display

Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Re: Why is p-value greater than 1 when it should not

Thread Information

Users Browsing this Thread

Similar Threads

[SOLVED] find value which is greater than in a range of cells and return the greater value

[SOLVED] count values greater 2 or greater in a column.

[SOLVED] Value must be 1 or greater

Adding Cells, if Sum is Greater than - How much greater?

Greater than value than yes

Greater than or Less than.. If Greater Need Difference

How to record greater than 50 in a cell so it reads as greater than 50

Bookmarks

Bookmarks

Posting Permissions