by Skye
Last Updated September 14, 2018 15:19 PM

This is somewhat both a programming question, and a stats question. So sorry for the overlap (although it seems that there is a large overlap between the realm of stats and programming of professionals). I have a dataset with ~1000 cases matched to ~100,000 controls (each case matched to multiple controls). However, the matches are not mutually exclusive. That is, some of the controls that are matched to one case, may also be matched to another case. However, I do not consider it many-to-many matching because I can't match the cases to each other (an explanation of the reason for this would make this post unbearably long, so I'll try to spare it if possible)

What I would like to do is compute a mean difference (essentially a grand mean difference), with confidence intervals. Lets assume the variable of interest is normally distributed. How can I construct these confidence intervals in R and properly account for this matching scheme? I'm open to bootstrapping if necessary, but would like to avoid it if possible (mainly in the interest of time).

- Serverfault Help
- Superuser Help
- Ubuntu Help
- Webapps Help
- Webmasters Help
- Programmers Help
- Dba Help
- Drupal Help
- Wordpress Help
- Magento Help
- Joomla Help
- Android Help
- Apple Help
- Game Help
- Gaming Help
- Blender Help
- Ux Help
- Cooking Help
- Photo Help
- Stats Help
- Math Help
- Diy Help
- Gis Help
- Tex Help
- Meta Help
- Electronics Help
- Stackoverflow Help
- Bitcoin Help
- Ethereum Help