Tilted Forum Project Discussion Community

Tilted Forum Project Discussion Community (https://thetfp.com/tfp/)
-   Tilted Knowledge and How-To (https://thetfp.com/tfp/tilted-knowledge-how/)
-   -   Multivariable Logistic Regression (https://thetfp.com/tfp/tilted-knowledge-how/128450-multivariable-logistic-regression.html)

genuinegirly 12-06-2007 06:27 PM

Multivariable Logistic Regression
 
Anyone know how to run a multivariable logistic regression in JMP?
Or some other way to analyze my data?

I have three independent variables an one dependent. All are categorical.

Doubt anyone knows this junk, but I've seen a few scientists about the boards, figure someone might have a clue.

I'll give out more info if anyone is interested in helping.

Hain 12-07-2007 04:37 AM

I haven't the slightest idea of JMP, nor do I know the nature of the data, but I am pretty good at making programs that model data. Since you say regression I assume you are given a list of points, each point with four elements: Pi=xi,yi,zi,ri.

So ri ~ f(xi,yi,zi), what is f(x,y,z)?

Given f(x,y,z) isn't too complex I can make a simple algorithm that gives you the constants of your equation.

Bear Cub 12-07-2007 12:52 PM

Not familiar with JMP, but do you have any access to MiniTab software?

Elphaba 12-07-2007 01:30 PM

Argg...I took that class in the '80's and certainly any software used is long obsolete. Thanks a bunch, gg. My post traumatic statistics disorder has kicked in again. :)

Plan9 12-07-2007 01:44 PM

Man, I bet WillRavel knows something about this stuff.

abaya 12-07-2007 03:07 PM

I fucking hate statistics. I learned about this in my Regression class in 2006, but that all faded away about 2 hours after I took the final exam, unfortunately.

Sapiens is the hands-down statistical king of TFP. I would PM him if you need help.

genuinegirly 12-07-2007 03:27 PM

Thanks, Augi, and others for your responses here! Unfortunately I haven't even taken a statistics course. They threw me into this field class with this JMP program, and told me to analyze my data with it. The graduate student who was helping me reached the limit of her knowledge when she realized that I did not have continous data.

Sorry, chassiswelder, no minitab access. Just JMP.

I'm supposed to be able to find a simple equation in y= mx+b format to plug into the raster calculator of ArcGIS (mapping), to make a predictive model of the population of my sedge.

I've spoken with a couple of faculty members about the issue, and they've given me a couple of other work-arounds. If/then statements to plug in, if I can figure out to plug them into the GIS program. Unfortunately now I haven't gotten an e-mail response from the program's GIS expert to tell me if that's possible. I'm thinking that I'll resort to making my layers transparent, overlaying them, and making a new polygon shapefile layer highlighting every occurance of the specific combination of abiotic factors that I'm dealing with.

jorgelito 12-07-2007 08:47 PM

Quote:

Originally Posted by Crompsin
Man, I bet WillRavel knows something about this stuff.

Why would you assume that? His training is in psychology, not statistics.

Hain 12-08-2007 07:19 AM

Quote:

Originally Posted by jorgelito
Why would you assume that? His training is in psychology, not statistics.

From what my psychology professors tell me, psychology majors learn a lot of statistics, actually. They learn even more if they want to become researchers.

Now I am confused since you said you have three independent variables, not one like a simple y=mx+b graph implies, to me anyhow. Are you going to make 3 linear regressions of each combination of 1 independent and the 1 dependent? I won't go into detail on a solution that uses matrices to produce the best fitting lines if that is not what is happening.

genuinegirly 12-08-2007 10:49 AM

Right, that's the problem. it's not as simple as y=mx+b. But that's what the mapping program supposedly needs. The mapping program could also handle something along the lines of y= mx + nx+ ox...+b.

Hain 12-08-2007 11:58 AM

If the equation is r=ax+by+cz+d the algorithm uses simple matrices:
(X<sup>T</sup> * X) * (X<sup>T</sup> * R) = [ a ; b ; c ; d ]
With:
X = [ x<sub>1</sub> , y<sub>1</sub> , z<sub>1</sub> , 1 ; x<sub>2</sub> , y<sub>2</sub> , z<sub>2</sub> , 1 ; ... ; x<sub>n</sub> , y<sub>n</sub> , z<sub>n</sub> , 1 ]
and
R = [ r<sub>1</sub> ; r<sub>2</sub> ; ... r<sub>n</sub> ]


Some notes:
For this to actually work, n must be equal to or greater than 4. If you have only 4 points then you are finding an exact equation, if n is greater than 4 then it is a best fit ... hyperplane.

If you have a graphing calculator it can perform these matrix equations. If you know some basic programming you can make the computer perform these equations. You can even make Excel perform these computations. It would be easiest to have a graphing calculator at the ready... those other two methods require a lot of work, luckily the amount of work is not determined by the number of points in your data.

genuinegirly 12-08-2007 01:15 PM

Thank you, Augi!

I have a TI-83 graphing calculator. I know nothing about matrices. I really appreciate your help.

Here's a bit about my data:

I have 8 soil types. 83 locations total where the soil was collected. Two of the soil types only occur once. The rest have anywhere from 5 to 20 occurances.

I have 9 "average percipitation ranges". 83 locations. "average percipitation ranges" occur anywhere from 4 to 30 times each. (ie- 4 locations have 2000cm of precipitation/year, 30 locations have 4300cm/year).

I have 2 "average temperature ranges". 83 locations. One "average temperature range" occurs in 5 locations, another 79 locations.

I have presence/absence data for my plant for these 83 locations. Absent in 23 locations, present in 60 locations.

I'd like to make a predictive model of where you're likely to find my plant on the entire island, based on what I can gather from temperature, soil, and precipitation data.

What would X, R, and n be?

Hain 12-08-2007 02:01 PM

X is the matrix that includes your input data. R is the matrix of the results. n is just element index---in this case n goes from 1 to 83.

I am afraid that this method will not work, now seeing what your data are.

There are two non-linear elements in the data: soil type and plant presence/absence. The plant presence/absence can be generated as 1 for presence and 0 for absence, however that will give poor results. If the plant presence/absence data were replaced with a plant growth scale, like the height of the plant, then this would be more manageable. The soil type itself is no way a linear scale... my method would only work so far as to give you 8 linear models, one for each soil type. If you had numerical properties of the soil type then that could be modeled.

Then, the model itself is not a linear one. Both temperature and precipitation have an optimum range. Plant presence falls the further away you get from that region---not linear behavior. A second order polynomial would be better... This is not a big deal with matrices, but the data are not polynomial of any order---they are bell curves. I do not know how to begin to model this. Sorry.

genuinegirly 12-08-2007 02:50 PM

Thank you for your efforts!


All times are GMT -8. The time now is 03:13 PM.

Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
Search Engine Optimization by vBSEO 3.6.0 PL2
© 2002-2012 Tilted Forum Project


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360