Tilted Forum Project Discussion Community  

Go Back   Tilted Forum Project Discussion Community > The Academy > Tilted Knowledge and How-To


 
 
LinkBack Thread Tools
Old 12-06-2007, 06:27 PM   #1 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Multivariable Logistic Regression

Anyone know how to run a multivariable logistic regression in JMP?
Or some other way to analyze my data?

I have three independent variables an one dependent. All are categorical.

Doubt anyone knows this junk, but I've seen a few scientists about the boards, figure someone might have a clue.

I'll give out more info if anyone is interested in helping.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-07-2007, 04:37 AM   #2 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
I haven't the slightest idea of JMP, nor do I know the nature of the data, but I am pretty good at making programs that model data. Since you say regression I assume you are given a list of points, each point with four elements: Pi=xi,yi,zi,ri.

So ri ~ f(xi,yi,zi), what is f(x,y,z)?

Given f(x,y,z) isn't too complex I can make a simple algorithm that gives you the constants of your equation.
__________________
Hain is offline  
Old 12-07-2007, 12:52 PM   #3 (permalink)
The Reverend Side Boob
 
Bear Cub's Avatar
 
Location: Nofe Curolina
Not familiar with JMP, but do you have any access to MiniTab software?
Bear Cub is offline  
Old 12-07-2007, 01:30 PM   #4 (permalink)
Deja Moo
 
Elphaba's Avatar
 
Location: Olympic Peninsula, WA
Argg...I took that class in the '80's and certainly any software used is long obsolete. Thanks a bunch, gg. My post traumatic statistics disorder has kicked in again.
__________________
"You can't ignore politics, no matter how much you'd like to." Molly Ivins - 1944-2007
Elphaba is offline  
Old 12-07-2007, 01:44 PM   #5 (permalink)
I Confess a Shiver
 
Plan9's Avatar
 
Man, I bet WillRavel knows something about this stuff.
__________________
Whatever you can carry.

"You should not drink... and bake."
Plan9 is offline  
Old 12-07-2007, 03:07 PM   #6 (permalink)
 
abaya's Avatar
 
Location: Iceland
I fucking hate statistics. I learned about this in my Regression class in 2006, but that all faded away about 2 hours after I took the final exam, unfortunately.

Sapiens is the hands-down statistical king of TFP. I would PM him if you need help.
__________________
And think not you can direct the course of Love;
for Love, if it finds you worthy, directs your course.

--Khalil Gibran
abaya is offline  
Old 12-07-2007, 03:27 PM   #7 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thanks, Augi, and others for your responses here! Unfortunately I haven't even taken a statistics course. They threw me into this field class with this JMP program, and told me to analyze my data with it. The graduate student who was helping me reached the limit of her knowledge when she realized that I did not have continous data.

Sorry, chassiswelder, no minitab access. Just JMP.

I'm supposed to be able to find a simple equation in y= mx+b format to plug into the raster calculator of ArcGIS (mapping), to make a predictive model of the population of my sedge.

I've spoken with a couple of faculty members about the issue, and they've given me a couple of other work-arounds. If/then statements to plug in, if I can figure out to plug them into the GIS program. Unfortunately now I haven't gotten an e-mail response from the program's GIS expert to tell me if that's possible. I'm thinking that I'll resort to making my layers transparent, overlaying them, and making a new polygon shapefile layer highlighting every occurance of the specific combination of abiotic factors that I'm dealing with.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy

Last edited by genuinegirly; 12-07-2007 at 03:29 PM..
genuinegirly is offline  
Old 12-07-2007, 08:47 PM   #8 (permalink)
All important elusive independent swing voter...
 
jorgelito's Avatar
 
Location: People's Republic of KKKalifornia
Quote:
Originally Posted by Crompsin
Man, I bet WillRavel knows something about this stuff.
Why would you assume that? His training is in psychology, not statistics.
jorgelito is offline  
Old 12-08-2007, 07:19 AM   #9 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
Quote:
Originally Posted by jorgelito
Why would you assume that? His training is in psychology, not statistics.
From what my psychology professors tell me, psychology majors learn a lot of statistics, actually. They learn even more if they want to become researchers.

Now I am confused since you said you have three independent variables, not one like a simple y=mx+b graph implies, to me anyhow. Are you going to make 3 linear regressions of each combination of 1 independent and the 1 dependent? I won't go into detail on a solution that uses matrices to produce the best fitting lines if that is not what is happening.
__________________
Hain is offline  
Old 12-08-2007, 10:49 AM   #10 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Right, that's the problem. it's not as simple as y=mx+b. But that's what the mapping program supposedly needs. The mapping program could also handle something along the lines of y= mx + nx+ ox...+b.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-08-2007, 11:58 AM   #11 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
If the equation is r=ax+by+cz+d the algorithm uses simple matrices:
(X<sup>T</sup> * X) * (X<sup>T</sup> * R) = [ a ; b ; c ; d ]
With:
X = [ x<sub>1</sub> , y<sub>1</sub> , z<sub>1</sub> , 1 ; x<sub>2</sub> , y<sub>2</sub> , z<sub>2</sub> , 1 ; ... ; x<sub>n</sub> , y<sub>n</sub> , z<sub>n</sub> , 1 ]
and
R = [ r<sub>1</sub> ; r<sub>2</sub> ; ... r<sub>n</sub> ]


Some notes:
For this to actually work, n must be equal to or greater than 4. If you have only 4 points then you are finding an exact equation, if n is greater than 4 then it is a best fit ... hyperplane.

If you have a graphing calculator it can perform these matrix equations. If you know some basic programming you can make the computer perform these equations. You can even make Excel perform these computations. It would be easiest to have a graphing calculator at the ready... those other two methods require a lot of work, luckily the amount of work is not determined by the number of points in your data.
__________________

Last edited by Hain; 12-08-2007 at 12:06 PM..
Hain is offline  
Old 12-08-2007, 01:15 PM   #12 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thank you, Augi!

I have a TI-83 graphing calculator. I know nothing about matrices. I really appreciate your help.

Here's a bit about my data:

I have 8 soil types. 83 locations total where the soil was collected. Two of the soil types only occur once. The rest have anywhere from 5 to 20 occurances.

I have 9 "average percipitation ranges". 83 locations. "average percipitation ranges" occur anywhere from 4 to 30 times each. (ie- 4 locations have 2000cm of precipitation/year, 30 locations have 4300cm/year).

I have 2 "average temperature ranges". 83 locations. One "average temperature range" occurs in 5 locations, another 79 locations.

I have presence/absence data for my plant for these 83 locations. Absent in 23 locations, present in 60 locations.

I'd like to make a predictive model of where you're likely to find my plant on the entire island, based on what I can gather from temperature, soil, and precipitation data.

What would X, R, and n be?
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-08-2007, 02:01 PM   #13 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
X is the matrix that includes your input data. R is the matrix of the results. n is just element index---in this case n goes from 1 to 83.

I am afraid that this method will not work, now seeing what your data are.

There are two non-linear elements in the data: soil type and plant presence/absence. The plant presence/absence can be generated as 1 for presence and 0 for absence, however that will give poor results. If the plant presence/absence data were replaced with a plant growth scale, like the height of the plant, then this would be more manageable. The soil type itself is no way a linear scale... my method would only work so far as to give you 8 linear models, one for each soil type. If you had numerical properties of the soil type then that could be modeled.

Then, the model itself is not a linear one. Both temperature and precipitation have an optimum range. Plant presence falls the further away you get from that region---not linear behavior. A second order polynomial would be better... This is not a big deal with matrices, but the data are not polynomial of any order---they are bell curves. I do not know how to begin to model this. Sorry.
__________________
Hain is offline  
Old 12-08-2007, 02:50 PM   #14 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thank you for your efforts!
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
 

Tags
logistic, multivariable, regression


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -8. The time now is 09:53 PM.

Tilted Forum Project

Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
Search Engine Optimization by vBSEO 3.6.0 PL2
© 2002-2012 Tilted Forum Project

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73