<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel>
        <title>Statistical Consulting Center Forums - Multivariate Analysis</title>
        <description>Forum for the observation and analysis of more than one statistical variable at a time.</description>
        <link>http://forums.stat.ucla.edu/list.php?7</link>
        <lastBuildDate>Mon, 23 Nov 2009 22:46:00 -0800</lastBuildDate>
        <generator>Phorum 5.2.10</generator>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,151,151#msg-151</guid>
            <title>Likert data (8 replies)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,151,151#msg-151</link>
            <description><![CDATA[ 150 patients who are to be screened for a relatively rare disease have answered eleven questions about how much they agree or disagree with statements related to barriers to their diagnosis.  For example, they are asked to repond to the questions &quot;I do not have a physician for testing&quot;  or &quot;I have no insurance coverage to pay for testing&quot;.  <br />
<br />
They may choose from five Likert-style responses:<br />
<br />
'Strongly Disagree'<br />
'Disagree' <br />
'Neutral'<br />
'Strongly Agree'<br />
'Agree' <br />
<br />
A colleague suggested dichotomizing the responses - i.e. 'Strongly Disagree' and 'Disagree' combined into '0/no', and 'Strongly Agree' and 'Agree' combined into '1/yes' and then using logistic to regress the response about barriers to diagnosis on risk factors (family history, childhood health issues, and current (adult) health issues).<br />
<br />
I am here for a second opinion.<br />
<br />
I also am unsure of what to do with the 'Neutral' responses when dichotimizing.<br />
<br />
Thank you!]]></description>
            <dc:creator>bridgette</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Tue, 17 Nov 2009 10:33:50 -0800</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,110,110#msg-110</guid>
            <title>Ordinal variables in Multiple Regression (2 replies)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,110,110#msg-110</link>
            <description><![CDATA[ Hi, I would be very grateful if someone could help me.<br />
<br />
I have 2 types of dependent variables: binary (yes/no) and ordinal (5 likert categories).  And 4 types predictors: binary, continuous (age), ordinal (education) and interval (income).<br />
<br />
I would like to analyse each outcome against the whole set of predictors, preferable with SPSS.  I assume that I have to use Binary Logistic Regression for the binary outcome; and Ordinal Logistic Regression for the ordinal outcome. However, I don't know how to deal with the ordinal &amp; interval predictors in both mentioned methods.  All the books i have read only show to use the Logistic Regression with categorial predictor (using dummy variable), without considering the rank/order of the categorial predictor.<br />
<br />
Many thanks in advance for your help.]]></description>
            <dc:creator>meilinda</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Sat, 03 Oct 2009 06:00:31 -0700</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,79,79#msg-79</guid>
            <title>multifactorial logistical regression? (1 reply)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,79,79#msg-79</link>
            <description><![CDATA[ I have conducted a survey study of surgeons on clinical decision making.  I have a study that has one dependent variable and four independent variables.  All variables are &quot;nominal&quot; or &quot;categorical.&quot;  Basically my dependent variable is the answer to a yes/no question: would you perform surgery?  The independent variables are location (East, Midwest, South, West), experience (early career, mid career, late career), type of surgeon (ENT surgeon, general surgeon), and surgical volume (few, little, medium, lots).<br />
     I think that the test I need to perform is a &quot;multi factorial logistic regression&quot;?  Is this correct?  And anybody know what is the easiest way to perform this test i.e. what software program (online, bought, or otherwise) is best?]]></description>
            <dc:creator>statnoob</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Mon, 17 Aug 2009 12:04:06 -0700</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,65,65#msg-65</guid>
            <title>Predictive modeling using Logistic Regression (8 replies)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,65,65#msg-65</link>
            <description><![CDATA[ I'm trying to do a predictive modeling for a research project. Trying to use a set of predictors to predict an individual's probability of being a case (patient).<br />
<br />
The Training dataset is about 2500 cases and 6500 controls. 700 variables were selected comparing case vs control group at p&lt;0.001.<br />
The independent dataset for validating the result is 500 cases and 500 controls.<br />
So I use Logistic Regression on the training dataset, and selected 350 predicting variables (out from the 700 significant variables) using stepwise selection, cut-off at p=0.05<br />
<br />
Now I'd like to apply these 350 variables on the 500 case/500 control dataset; and see how good it can do the prediction of an individual being a patient. But it is small and 350 variables will be overfitting using Logistic Regression. I still apply Logistic Regression to do stepwise selection, using p=0.05 (35 variables were selected) and p=0.10 (60 variables were selected), p=0.15 (76 variables were selected) and p=0.20 (118 variables were selected); the Hosmer &amp; Lemeshow Goodness of fit for all four selections, p&gt;0.05.<br />
<br />
I'm not sure if this is a good approach, or do you have any suggestion in terms of how to use the independent dataset to do validation from the 350 variables selected from the training dataset? Thank you very much.]]></description>
            <dc:creator>HuiW</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Wed, 22 Jul 2009 12:53:42 -0700</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,23,23#msg-23</guid>
            <title>Modeling zero inflated binary response with multi-classes categorical independent variables (7 replies)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,23,23#msg-23</link>
            <description><![CDATA[ Is this the right place to ask statistical question?]]></description>
            <dc:creator>questionboy</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Thu, 02 Apr 2009 22:17:54 -0700</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,22,22#msg-22</guid>
            <title>third post (no replies)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,22,22#msg-22</link>
            <description><![CDATA[ Nice work done!]]></description>
            <dc:creator>questionboy</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Thu, 02 Apr 2009 21:27:44 -0700</pubDate>
        </item>
        <item>
            <guid>http://forums.stat.ucla.edu/read.php?7,4,4#msg-4</guid>
            <title>First post (1 reply)</title>
            <link>http://forums.stat.ucla.edu/read.php?7,4,4#msg-4</link>
            <description><![CDATA[ Welcome to SCC's Mulivariate Analysis forum.  If you are subscribed to the feed or are a moderator you should receive notice of this posting.]]></description>
            <dc:creator>Jose Hales-Garcia</dc:creator>
            <category>Multivariate Analysis</category>
            <pubDate>Mon, 02 Mar 2009 12:06:54 -0800</pubDate>
        </item>
    </channel>
</rss>
