Tilted Forum Project Discussion Community  

Go Back   Tilted Forum Project Discussion Community > The Academy > Tilted Knowledge and How-To


 
 
LinkBack Thread Tools
Old 12-06-2007, 06:27 PM   #1 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Multivariable Logistic Regression

Anyone know how to run a multivariable logistic regression in JMP?
Or some other way to analyze my data?

I have three independent variables an one dependent. All are categorical.

Doubt anyone knows this junk, but I've seen a few scientists about the boards, figure someone might have a clue.

I'll give out more info if anyone is interested in helping.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-07-2007, 04:37 AM   #2 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
I haven't the slightest idea of JMP, nor do I know the nature of the data, but I am pretty good at making programs that model data. Since you say regression I assume you are given a list of points, each point with four elements: Pi=xi,yi,zi,ri.

So ri ~ f(xi,yi,zi), what is f(x,y,z)?

Given f(x,y,z) isn't too complex I can make a simple algorithm that gives you the constants of your equation.
__________________
Hain is offline  
Old 12-07-2007, 12:52 PM   #3 (permalink)
The Reverend Side Boob
 
Bear Cub's Avatar
 
Location: Nofe Curolina
Not familiar with JMP, but do you have any access to MiniTab software?
Bear Cub is offline  
Old 12-07-2007, 01:30 PM   #4 (permalink)
Deja Moo
 
Elphaba's Avatar
 
Location: Olympic Peninsula, WA
Argg...I took that class in the '80's and certainly any software used is long obsolete. Thanks a bunch, gg. My post traumatic statistics disorder has kicked in again.
__________________
"You can't ignore politics, no matter how much you'd like to." Molly Ivins - 1944-2007
Elphaba is offline  
Old 12-07-2007, 01:44 PM   #5 (permalink)
I Confess a Shiver
 
Plan9's Avatar
 
Man, I bet WillRavel knows something about this stuff.
__________________
Whatever you can carry.

"You should not drink... and bake."
Plan9 is offline  
Old 12-07-2007, 03:07 PM   #6 (permalink)
 
abaya's Avatar
 
Location: Iceland
I fucking hate statistics. I learned about this in my Regression class in 2006, but that all faded away about 2 hours after I took the final exam, unfortunately.

Sapiens is the hands-down statistical king of TFP. I would PM him if you need help.
__________________
And think not you can direct the course of Love;
for Love, if it finds you worthy, directs your course.

--Khalil Gibran
abaya is offline  
Old 12-07-2007, 03:27 PM   #7 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thanks, Augi, and others for your responses here! Unfortunately I haven't even taken a statistics course. They threw me into this field class with this JMP program, and told me to analyze my data with it. The graduate student who was helping me reached the limit of her knowledge when she realized that I did not have continous data.

Sorry, chassiswelder, no minitab access. Just JMP.

I'm supposed to be able to find a simple equation in y= mx+b format to plug into the raster calculator of ArcGIS (mapping), to make a predictive model of the population of my sedge.

I've spoken with a couple of faculty members about the issue, and they've given me a couple of other work-arounds. If/then statements to plug in, if I can figure out to plug them into the GIS program. Unfortunately now I haven't gotten an e-mail response from the program's GIS expert to tell me if that's possible. I'm thinking that I'll resort to making my layers transparent, overlaying them, and making a new polygon shapefile layer highlighting every occurance of the specific combination of abiotic factors that I'm dealing with.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy

Last edited by genuinegirly; 12-07-2007 at 03:29 PM..
genuinegirly is offline  
Old 12-07-2007, 08:47 PM   #8 (permalink)
All important elusive independent swing voter...
 
jorgelito's Avatar
 
Location: People's Republic of KKKalifornia
Quote:
Originally Posted by Crompsin
Man, I bet WillRavel knows something about this stuff.
Why would you assume that? His training is in psychology, not statistics.
jorgelito is offline  
Old 12-08-2007, 07:19 AM   #9 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
Quote:
Originally Posted by jorgelito
Why would you assume that? His training is in psychology, not statistics.
From what my psychology professors tell me, psychology majors learn a lot of statistics, actually. They learn even more if they want to become researchers.

Now I am confused since you said you have three independent variables, not one like a simple y=mx+b graph implies, to me anyhow. Are you going to make 3 linear regressions of each combination of 1 independent and the 1 dependent? I won't go into detail on a solution that uses matrices to produce the best fitting lines if that is not what is happening.
__________________
Hain is offline  
Old 12-08-2007, 10:49 AM   #10 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Right, that's the problem. it's not as simple as y=mx+b. But that's what the mapping program supposedly needs. The mapping program could also handle something along the lines of y= mx + nx+ ox...+b.
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-08-2007, 11:58 AM   #11 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
If the equation is r=ax+by+cz+d the algorithm uses simple matrices:
(X<sup>T</sup> * X) * (X<sup>T</sup> * R) = [ a ; b ; c ; d ]
With:
X = [ x<sub>1</sub> , y<sub>1</sub> , z<sub>1</sub> , 1 ; x<sub>2</sub> , y<sub>2</sub> , z<sub>2</sub> , 1 ; ... ; x<sub>n</sub> , y<sub>n</sub> , z<sub>n</sub> , 1 ]
and
R = [ r<sub>1</sub> ; r<sub>2</sub> ; ... r<sub>n</sub> ]


Some notes:
For this to actually work, n must be equal to or greater than 4. If you have only 4 points then you are finding an exact equation, if n is greater than 4 then it is a best fit ... hyperplane.

If you have a graphing calculator it can perform these matrix equations. If you know some basic programming you can make the computer perform these equations. You can even make Excel perform these computations. It would be easiest to have a graphing calculator at the ready... those other two methods require a lot of work, luckily the amount of work is not determined by the number of points in your data.
__________________

Last edited by Hain; 12-08-2007 at 12:06 PM..
Hain is offline  
Old 12-08-2007, 01:15 PM   #12 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thank you, Augi!

I have a TI-83 graphing calculator. I know nothing about matrices. I really appreciate your help.

Here's a bit about my data:

I have 8 soil types. 83 locations total where the soil was collected. Two of the soil types only occur once. The rest have anywhere from 5 to 20 occurances.

I have 9 "average percipitation ranges". 83 locations. "average percipitation ranges" occur anywhere from 4 to 30 times each. (ie- 4 locations have 2000cm of precipitation/year, 30 locations have 4300cm/year).

I have 2 "average temperature ranges". 83 locations. One "average temperature range" occurs in 5 locations, another 79 locations.

I have presence/absence data for my plant for these 83 locations. Absent in 23 locations, present in 60 locations.

I'd like to make a predictive model of where you're likely to find my plant on the entire island, based on what I can gather from temperature, soil, and precipitation data.

What would X, R, and n be?
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
Old 12-08-2007, 02:01 PM   #13 (permalink)
has a plan
 
Hain's Avatar
 
Location: middle of Whywouldanyonebethere
X is the matrix that includes your input data. R is the matrix of the results. n is just element index---in this case n goes from 1 to 83.

I am afraid that this method will not work, now seeing what your data are.

There are two non-linear elements in the data: soil type and plant presence/absence. The plant presence/absence can be generated as 1 for presence and 0 for absence, however that will give poor results. If the plant presence/absence data were replaced with a plant growth scale, like the height of the plant, then this would be more manageable. The soil type itself is no way a linear scale... my method would only work so far as to give you 8 linear models, one for each soil type. If you had numerical properties of the soil type then that could be modeled.

Then, the model itself is not a linear one. Both temperature and precipitation have an optimum range. Plant presence falls the further away you get from that region---not linear behavior. A second order polynomial would be better... This is not a big deal with matrices, but the data are not polynomial of any order---they are bell curves. I do not know how to begin to model this. Sorry.
__________________
Hain is offline  
Old 12-08-2007, 02:50 PM   #14 (permalink)
Eat your vegetables
 
genuinegirly's Avatar
 
Super Moderator
Location: Arabidopsis-ville
Thank you for your efforts!
__________________
"Sometimes I have to remember that things are brought to me for a reason, either for my own lessons or for the benefit of others." Cynthetiq

"violence is no more or less real than non-violence." roachboy
genuinegirly is offline  
 

Tags
logistic, multivariable, regression


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -8. The time now is 02:30 PM.

Tilted Forum Project

Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2024, vBulletin Solutions, Inc.
Search Engine Optimization by vBSEO 3.6.0 PL2
© 2002-2012 Tilted Forum Project

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360