tag:blogger.com,1999:blog-37324607.post7493038892805965030..comments2023-11-03T08:31:23.698-04:00Comments on Data Mining in MATLAB: Stratified SamplingWill Dwinnellhttp://www.blogger.com/profile/03379859054257561952noreply@blogger.comBlogger7125tag:blogger.com,1999:blog-37324607.post-45072507806200056012015-01-27T11:24:30.106-05:002015-01-27T11:24:30.106-05:00Hello,
Thank you for your very helpful blog; howe...Hello,<br /><br />Thank you for your very helpful blog; however I am still a little lost as to how I can achieve stratified 10 fold cross validation. Please forgive my, I am a novice at Matlab. <br /><br />I have read this post along with how to divide data randomly into equal size groups and I can't quite figure out how to merge both tips.<br />http://matlabdatamining.blogspot.co.uk/2007/02/dividing-data-randomly-into-equal-sized.html<br /><br />Would you point me in the right direction please?<br /><br />Many thanks!<br />KAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-37324607.post-71899034853130025502013-02-19T02:46:48.379-05:002013-02-19T02:46:48.379-05:00Thanks for this nice post! However I have a questi...Thanks for this nice post! However I have a question here:<br />What if there is a 7th column in the SimpleData, for example, 30% of the 100000 people are yong (<18 years old) and 70% of them are older than 18?<br />Now we have two things to consider: the gender and age. How do we do stratified random sampling in this case?cguitarhttps://www.blogger.com/profile/01032181616134167318noreply@blogger.comtag:blogger.com,1999:blog-37324607.post-73874457182833747352009-04-21T22:34:00.000-04:002009-04-21T22:34:00.000-04:00Stratified sampling is a special example of jackkn...Stratified sampling is a special example of jackknife resampling technique and the most usual stratified sampling is cross-validationn (CS).Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-37324607.post-36462536769185928612008-09-15T12:23:00.000-04:002008-09-15T12:23:00.000-04:00Hi!, nice page!!, I was wondering if somebody can ...Hi!, nice page!!, I was wondering if somebody can help me with some advices I have a data set with users and rating movies so I need to give a each user randomly 50% for train and 50% for testing, How can I do that in matlab?Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-37324607.post-9441355909955279102007-11-15T21:11:00.000-05:002007-11-15T21:11:00.000-05:00Data may be split into "train" and "test" groups v...Data may be split into "train" and "test" groups via simple random sampling ("SRS") or via stratified sampling. In the case of stratified sampling, there is usually some variable which the analyst desire remain similarly distributed among the "train" and "test" groups, such as the outcome class.Will Dwinnellhttps://www.blogger.com/profile/03379859054257561952noreply@blogger.comtag:blogger.com,1999:blog-37324607.post-19559490532183779862007-11-15T10:44:00.000-05:002007-11-15T10:44:00.000-05:00Thank you for your nice and easy-to-follow explana...Thank you for your nice and easy-to-follow explanation.<BR/><BR/>However, I have one small question: Does researchers use stratified sampling to split training data from testing data in a certain data set?<BR/><BR/>Thanks again,<BR/>ManaAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-37324607.post-18525243580111977602007-11-15T10:42:00.000-05:002007-11-15T10:42:00.000-05:00Thank you for this nice guide..Though I have a que...Thank you for this nice guide..<BR/><BR/>Though I have a question, do we use stratified sampling to split training set form testing set?Anonymousnoreply@blogger.com