71 results on '"Holmes CC"'
Search Results
2. Quality control and conduct of genome-wide association meta-analyses
- Author
-
Winkler, T, Day, F, Croteau Chonka, D, Wood, A, Locke, A, Mägi, R, Ferreira, T, Fall, T, Graff, M, Justice, A, Luan, J, Gustafsson, S, Randall, J, Vedantam, S, Workalemahu, T, Kilpeläinen, T, Scherag, A, Esko, T, Kutalik, Z, Heid, I, Loos, R, Abecasis GR, Absher D, Alavere H, Albrecht E, Allen HL, Almgren P, Amin N, Amouyel P, Anderson D, Arnold AM, Arveiler D, Aspelund T, Asselbergs FW, Assimes TL, Atalay M, Attwood AP, Atwood LD, Bakker SJ, Balkau B, Balmforth AJ, Barlassina C, Barroso I, Basart H, Bauer S, Beckmann JS, Beilby JP, Bennett AJ, Ben Shlomo Y, Bergman RN, Bergmann S, Berndt SI, Biffar R, Di Blasio AM, Boehm BO, Boehnke M, Boeing H, Boerwinkle E, Bolton JL, Bonnefond A, Bonnycastle LL, Boomsma DI, Borecki IB, Bornstein SR, Bouatia Naji N, Boucher G, Bragg Gresham JL, BRAMBILLA, PAOLO, Bruinenberg M, Buchanan TA, Buechler C, Cadby G, Campbell H, Caulfield MJ, Cavalcanti Proença C, CESANA, GIANCARLO, Chanock SJ, Chasman DI, Chen YD, Chines PS, Clegg DJ, Coin L, Collins FS, Connell JM, Cookson W, Cooper MN, Croteau Chonka DC, Cupples LA, Cusi D, Day FR, Day IN, Dedoussis GV, Dei M, Deloukas P, Dermitzakis ET, Dimas AS, Dimitriou M, Dixon AL, Dörr M, van Duijn CM, Ebrahim S, Edkins S, Eiriksdottir G, Eisinger K, Eklund N, Elliott P, Erbel R, Erdmann J, Erdos MR, Eriksson JG, Esko T, Estrada K, Evans DM, de Faire U, Fall T, Farrall M, Feitosa MF, Ferrario MM, Ferreira T, Ferrières J, Fischer K, Fisher E, Fowkes G, Fox CS, Franke L, Franks PW, Fraser RM, Frau F, Frayling T, Freimer NB, Froguel P, Fu M, Gaget S, Ganna A, Gejman PV, Gentilini D, Geus EJ, Gieger C, Gigante B, Gjesing AP, Glazer NL, Goddard ME, Goel A, Grallert H, Gräßler J, Grönberg H, Groop LC, Groves CJ, Gudnason V, Guiducci C, Gustafsson S, Gyllensten U, Hall AS, Hall P, Hallmans G, Hamsten A, Hansen T, Haritunians T, Harris TB, van der Harst P, Hartikainen AL, Hassanali N, Hattersley AT, Havulinna AS, Hayward C, Heard Costa NL, Heath AC, Hebebrand J, Heid IM, den Heijer M, Hengstenberg C, Herzig KH, Hicks AA, Hingorani A, Hinney A, Hirschhorn JN, Hofman A, Holmes CC, Homuth G, Hottenga JJ, Hovingh KG, Hu FB, Hu YJ, Huffman JE, Hui J, Huikuri H, Humphries SE, Hung J, Hunt SE, Hunter D, Hveem K, Hyppönen E, Igl W, Illig T, Ingelsson E, Iribarren C, Isomaa B, Jackson AU, Jacobs KB, James AL, Jansson JO, Jarick I, Jarvelin MR, Jöckel KH, Johansson Å, Johnson T, Jolley J, Jørgensen T, Jousilahti P, Jula A, Justice AE, Kaakinen M, Kähönen M, Kajantie E, Kanoni S, Kao WH, Kaplan LM, Kaplan RC, Kaprio J, Kapur K, Karpe F, Kathiresan S, Kee F, Keinanen Kiukaanniemi SM, Ketkar S, Kettunen J, Khaw KT, Kiemeney LA, Kilpeläinen TO, Kinnunen L, Kivimaki M, Kivmaki M, Van der Klauw MM, Kleber ME, Knowles JW, Koenig W, Kolcic I, Kolovou G, König IR, Koskinen S, Kovacs P, Kraft P, Kraja AT, Kristiansson K, KrjutÅjkov K, Kroemer HK, Krohn JP, Krzelj V, Kuh D, Kulzer JR, Kumari M, Kutalik Z, Kuulasmaa K, Kuusisto J, Kvaloy K, Laakso M, Laitinen JH, Lakka TA, Lamina C, Langenberg C, Lantieri O, Lathrop GM, Launer LJ, Lawlor DA, Lawrence RW, Leach IM, Lecoeur C, Lee SH, Lehtimäki T, Leitzmann MF, Lettre G, Levinson DF, Li G, Li S, Liang L, Lin DY, Lind L, Lindgren CM, Lindström J, Liu J, Liuzzi A, Locke AE, Lokki ML, Loley C, Loos RJ, Lorentzon M, Luan J, Luben RN, Ludwig B, Madden PA, Mägi R, Magnusson PK, Mangino M, Manunta P, Marek D, Marre M, Martin NG, März W, Maschio A, Mathieson I, McArdle WL, McCaroll SA, McCarthy A, McCarthy MI, McKnight B, Medina Gomez C, Medland SE, Meitinger T, Metspalu A, van Meurs JB, Meyre D, Midthjell K, Mihailov E, Milani L, Min JL, Moebus S, Moffatt MF, Mohlke KL, Molony C, Monda KL, Montgomery GW, Mooser V, Morken MA, Morris AD, Morris AP, Mühleisen TW, Müller Nurasyid M, Munroe PB, Musk AW, Narisu N, Navis G, Neale BM, Nelis M, Nemesh J, Neville MJ, Ngwa JS, Nicholson G, Nieminen MS, Njølstad I, Nohr EA, Nolte IM, North KE, Nöthen MM, Nyholt DR, O'Connell JR, Ohlsson C, Oldehinkel AJ, van Ommen GJ, Ong KK, Oostra BA, Ouwehand WH, Palmer CN, Palmer LJ, Palotie A, Paré G, Parker AN, Paternoster L, Pawitan Y, Pechlivanis S, Peden JF, Pedersen NL, Pedersen O, Pellikka N, Peltonen L, Penninx B, Perola M, Perry JR, Person T, Peters A, Peters MJ, Pichler I, Pietiläinen KH, Platou CG, Polasek O, Pouta A, Power C, Pramstaller PP, Preuss M, Price JF, Prokopenko I, Province MA, Psaty BM, Purcell S, Pütter C, Qi L, Quertermous T, Radhakrishnan A, Raitakari O, Randall JC, Rauramaa R, Rayner NW, Rehnberg E, Rendon A, Ridderstråle M, Ridker PM, Ripatti S, Rissanen A, Rivadeneira F, Rivolta C, Robertson NR, Rose LM, Rudan I, Saaristo TE, Sager H, Salomaa V, Samani NJ, Sambrook JG, Sanders AR, Sandholt C, Sanna S, Saramies J, Schadt EE, Scherag A, Schipf S, Schlessinger D, Schreiber S, Schunkert H, Schwarz PE, Scott LJ, Shi J, Shin SY, Shuldiner AR, Shungin D, Signorini S, Silander K, Sinisalo J, Skrobek B, Smit JH, Smith AV, Smith GD, Snieder H, Soranzo N, Sørensen TI, Sovio U, Spector TD, Speliotes EK, Stančáková A, Stark K, Stefansson K, Steinthorsdottir V, Stephens JC, Stirrups K, Stolk RP, Strachan DP, Strawbridge RJ, Stringham HM, Stumvoll M, Surakka I, Swift AJ, Syvanen AC, Tammesoo ML, Teder Laving M, Teslovich TM, Teumer A, Theodoraki EV, Thomson B, Thorand B, Thorleifsson G, Thorsteinsdottir U, Timpson NJ, Tönjes A, Tregouet DA, Tremoli E, Trip MD, Tuomi T, Tuomilehto J, Tyrer J, Uda M, Uitterlinden AG, Usala G, Uusitupa M, Valle TT, Vandenput L, Vatin V, Vedantam S, de Vegt F, Vermeulen SH, Viikari J, Virtamo J, Visscher PM, Vitart V, Van Vliet Ostaptchouk JV, Voight BF, Vollenweider P, Volpato CB, Völzke H, Waeber G, Waite LL, Wallaschofski H, Walters GB, Wang Z, Wareham NJ, Watanabe RM, Watkins H, Weedon MN, Welch R, Weyant RJ, Wheeler E, White CC, Wichmann HE, Widen E, Wild SH, Willemsen G, Willer CJ, Wilsgaard T, Wilson JF, van Wingerden S, Winkelmann BR, Winkler TW, Witte DR, Witteman JC, Wolffenbuttel BH, Wong A, Wood AR, Workalemahu T, Wright AF, Yang J, Yarnell JW, Zgaga L, Zhao JH, Zillikens MC, Zitting P, Zondervan KT, Life Course Epidemiology (LCE), Lifestyle Medicine (LM), Center for Liver, Digestive and Metabolic Diseases (CLDM), Winkler, T, Day, F, Croteau Chonka, D, Wood, A, Locke, A, Mägi, R, Ferreira, T, Fall, T, Graff, M, Justice, A, Luan, J, Gustafsson, S, Randall, J, Vedantam, S, Workalemahu, T, Kilpeläinen, T, Scherag, A, Esko, T, Kutalik, Z, Heid, I, Loos, R, Abecasis, G, Absher, D, Alavere, H, Albrecht, E, Allen, H, Almgren, P, Amin, N, Amouyel, P, Anderson, D, Arnold, A, Arveiler, D, Aspelund, T, Asselbergs, F, Assimes, T, Atalay, M, Attwood, A, Atwood, L, Bakker, S, Balkau, B, Balmforth, A, Barlassina, C, Barroso, I, Basart, H, Bauer, S, Beckmann, J, Beilby, J, Bennett, A, Ben Shlomo, Y, Bergman, R, Bergmann, S, Berndt, S, Biffar, R, Di Blasio, A, Boehm, B, Boehnke, M, Boeing, H, Boerwinkle, E, Bolton, J, Bonnefond, A, Bonnycastle, L, Boomsma, D, Borecki, I, Bornstein, S, Bouatia Naji, N, Boucher, G, Bragg Gresham, J, Brambilla, P, Bruinenberg, M, Buchanan, T, Buechler, C, Cadby, G, Campbell, H, Caulfield, M, Cavalcanti Proença, C, Cesana, G, Chanock, S, Chasman, D, Chen, Y, Chines, P, Clegg, D, Coin, L, Collins, F, Connell, J, Cookson, W, Cooper, M, Cupples, L, Cusi, D, Day, I, Dedoussis, G, Dei, M, Deloukas, P, Dermitzakis, E, Dimas, A, Dimitriou, M, Dixon, A, Dörr, M, van Duijn, C, Ebrahim, S, Edkins, S, Eiriksdottir, G, Eisinger, K, Eklund, N, Elliott, P, Erbel, R, Erdmann, J, Erdos, M, Eriksson, J, Estrada, K, Evans, D, de Faire, U, Farrall, M, Feitosa, M, Ferrario, M, Ferrières, J, Fischer, K, Fisher, E, Fowkes, G, Fox, C, Franke, L, Franks, P, Fraser, R, Frau, F, Frayling, T, Freimer, N, Froguel, P, Fu, M, Gaget, S, Ganna, A, Gejman, P, Gentilini, D, Geus, E, Gieger, C, Gigante, B, Gjesing, A, Glazer, N, Goddard, M, Goel, A, Grallert, H, Gräßler, J, Grönberg, H, Groop, L, Groves, C, Gudnason, V, Guiducci, C, Gyllensten, U, Hall, A, Hall, P, Hallmans, G, Hamsten, A, Hansen, T, Haritunians, T, Harris, T, van der Harst, P, Hartikainen, A, Hassanali, N, Hattersley, A, Havulinna, A, Hayward, C, Heard Costa, N, Heath, A, Hebebrand, J, den Heijer, M, Hengstenberg, C, Herzig, K, Hicks, A, Hingorani, A, Hinney, A, Hirschhorn, J, Hofman, A, Holmes, C, Homuth, G, Hottenga, J, Hovingh, K, Hu, F, Hu, Y, Huffman, J, Hui, J, Huikuri, H, Humphries, S, Hung, J, Hunt, S, Hunter, D, Hveem, K, Hyppönen, E, Igl, W, Illig, T, Ingelsson, E, Iribarren, C, Isomaa, B, Jackson, A, Jacobs, K, James, A, Jansson, J, Jarick, I, Jarvelin, M, Jöckel, K, Johansson, Å, Johnson, T, Jolley, J, Jørgensen, T, Jousilahti, P, Jula, A, Kaakinen, M, Kähönen, M, Kajantie, E, Kanoni, S, Kao, W, Kaplan, L, Kaplan, R, Kaprio, J, Kapur, K, Karpe, F, Kathiresan, S, Kee, F, Keinanen Kiukaanniemi, S, Ketkar, S, Kettunen, J, Khaw, K, Kiemeney, L, Kinnunen, L, Kivimaki, M, Kivmaki, M, Van der Klauw, M, Kleber, M, Knowles, J, Koenig, W, Kolcic, I, Kolovou, G, König, I, Koskinen, S, Kovacs, P, Kraft, P, Kraja, A, Kristiansson, K, Krjutåjkov, K, Kroemer, H, Krohn, J, Krzelj, V, Kuh, D, Kulzer, J, Kumari, M, Kuulasmaa, K, Kuusisto, J, Kvaloy, K, Laakso, M, Laitinen, J, Lakka, T, Lamina, C, Langenberg, C, Lantieri, O, Lathrop, G, Launer, L, Lawlor, D, Lawrence, R, Leach, I, Lecoeur, C, Lee, S, Lehtimäki, T, Leitzmann, M, Lettre, G, Levinson, D, Li, G, Li, S, Liang, L, Lin, D, Lind, L, Lindgren, C, Lindström, J, Liu, J, Liuzzi, A, Lokki, M, Loley, C, Lorentzon, M, Luben, R, Ludwig, B, Madden, P, Magnusson, P, Mangino, M, Manunta, P, Marek, D, Marre, M, Martin, N, März, W, Maschio, A, Mathieson, I, Mcardle, W, Mccaroll, S, Mccarthy, A, Mccarthy, M, Mcknight, B, Medina Gomez, C, Medland, S, Meitinger, T, Metspalu, A, van Meurs, J, Meyre, D, Midthjell, K, Mihailov, E, Milani, L, Min, J, Moebus, S, Moffatt, M, Mohlke, K, Molony, C, Monda, K, Montgomery, G, Mooser, V, Morken, M, Morris, A, Mühleisen, T, Müller Nurasyid, M, Munroe, P, Musk, A, Narisu, N, Navis, G, Neale, B, Nelis, M, Nemesh, J, Neville, M, Ngwa, J, Nicholson, G, Nieminen, M, Njølstad, I, Nohr, E, Nolte, I, North, K, Nöthen, M, Nyholt, D, O'Connell, J, Ohlsson, C, Oldehinkel, A, van Ommen, G, Ong, K, Oostra, B, Ouwehand, W, Palmer, C, Palmer, L, Palotie, A, Paré, G, Parker, A, Paternoster, L, Pawitan, Y, Pechlivanis, S, Peden, J, Pedersen, N, Pedersen, O, Pellikka, N, Peltonen, L, Penninx, B, Perola, M, Perry, J, Person, T, Peters, A, Peters, M, Pichler, I, Pietiläinen, K, Platou, C, Polasek, O, Pouta, A, Power, C, Pramstaller, P, Preuss, M, Price, J, Prokopenko, I, Province, M, Psaty, B, Purcell, S, Pütter, C, Qi, L, Quertermous, T, Radhakrishnan, A, Raitakari, O, Rauramaa, R, Rayner, N, Rehnberg, E, Rendon, A, Ridderstråle, M, Ridker, P, Ripatti, S, Rissanen, A, Rivadeneira, F, Rivolta, C, Robertson, N, Rose, L, Rudan, I, Saaristo, T, Sager, H, Salomaa, V, Samani, N, Sambrook, J, Sanders, A, Sandholt, C, Sanna, S, Saramies, J, Schadt, E, Schipf, S, Schlessinger, D, Schreiber, S, Schunkert, H, Schwarz, P, Scott, L, Shi, J, Shin, S, Shuldiner, A, Shungin, D, Signorini, S, Silander, K, Sinisalo, J, Skrobek, B, Smit, J, Smith, A, Smith, G, Snieder, H, Soranzo, N, Sørensen, T, Sovio, U, Spector, T, Speliotes, E, Stančáková, A, Stark, K, Stefansson, K, Steinthorsdottir, V, Stephens, J, Stirrups, K, Stolk, R, Strachan, D, Strawbridge, R, Stringham, H, Stumvoll, M, Surakka, I, Swift, A, Syvanen, A, Tammesoo, M, Teder Laving, M, Teslovich, T, Teumer, A, Theodoraki, E, Thomson, B, Thorand, B, Thorleifsson, G, Thorsteinsdottir, U, Timpson, N, Tönjes, A, Tregouet, D, Tremoli, E, Trip, M, Tuomi, T, Tuomilehto, J, Tyrer, J, Uda, M, Uitterlinden, A, Usala, G, Uusitupa, M, Valle, T, Vandenput, L, Vatin, V, de Vegt, F, Vermeulen, S, Viikari, J, Virtamo, J, Visscher, P, Vitart, V, Van Vliet Ostaptchouk, J, Voight, B, Vollenweider, P, Volpato, C, Völzke, H, Waeber, G, Waite, L, Wallaschofski, H, Walters, G, Wang, Z, Wareham, N, Watanabe, R, Watkins, H, Weedon, M, Welch, R, Weyant, R, Wheeler, E, White, C, Wichmann, H, Widen, E, Wild, S, Willemsen, G, Willer, C, Wilsgaard, T, Wilson, J, van Wingerden, S, Winkelmann, B, Witte, D, Witteman, J, Wolffenbuttel, B, Wong, A, Wright, A, Yang, J, Yarnell, J, Zgaga, L, Zhao, J, Zillikens, M, Zitting, P, Zondervan, K, Psychiatry, EMGO - Mental health, Plastic, Reconstructive and Hand Surgery, ACS - Amsterdam Cardiovascular Sciences, Vascular Medicine, Cardiology, Biological Psychology, EMGO+ - Mental Health, Genetic Investigation of Anthropometric Traits (GIANT) Consortium, Abecasis, GR., Absher, D., Alavere, H., Albrecht, E., Allen, HL., Almgren, P., Amin, N., Amouyel, P., Anderson, D., Arnold, AM., Arveiler, D., Aspelund, T., Asselbergs, FW., Assimes, TL., Atalay, M., Attwood, AP., Atwood, LD., Bakker, SJ., Balkau, B., Balmforth, AJ., Barlassina, C., Barroso£££Inês£££ I., Basart, H., Bauer, S., Beckmann, JS., Beilby, JP., Bennett, AJ., Ben-Shlomo, Y., Bergman, RN., Bergmann, S., Berndt, SI., Biffar, R., Di Blasio AM., Boehm, BO., Boehnke, M., Boeing, H., Boerwinkle, E., Bolton, JL., Bonnefond, A., Bonnycastle, LL., Boomsma, DI., Borecki, IB., Bornstein, SR., Bouatia-Naji, N., Boucher, G., Bragg-Gresham, JL., Brambilla, P., Bruinenberg, M., Buchanan, TA., Buechler, C., Cadby, G., Campbell, H., Caulfield, MJ., Cavalcanti-Proença, C., Cesana, G., Chanock, SJ., Chasman, DI., Chen, YD., Chines, PS., Clegg, DJ., Coin, L., Collins, FS., Connell, JM., Cookson, W., Cooper, MN., Croteau-Chonka, DC., Cupples, LA., Cusi, D., Day, FR., Day, IN., Dedoussis, GV., Dei, M., Deloukas, P., Dermitzakis, ET., Dimas, AS., Dimitriou, M., Dixon, AL., Dörr, M., van Duijn CM., Ebrahim, S., Edkins, S., Eiriksdottir, G., Eisinger, K., Eklund, N., Elliott, P., Erbel, R., Erdmann, J., Erdos, MR., Eriksson, JG., Esko£££Tõnu£££ T., Estrada, K., Evans, DM., de Faire, U., Fall, T., Farrall, M., Feitosa, MF., Ferrario, MM., Ferreira, T., Ferrières, J., Fischer, K., Fisher, E., Fowkes, G., Fox, CS., Franke, L., Franks, PW., Fraser, RM., Frau, F., Frayling, T., Freimer, NB., Froguel, P., Fu, M., Gaget, S., Ganna, A., Gejman, PV., Gentilini, D., Geus, EJ., Gieger, C., Gigante, B., Gjesing, AP., Glazer, NL., Goddard, ME., Goel, A., Grallert, H., Gräßler, J., Grönberg, H., Groop, LC., Groves, CJ., Gudnason, V., Guiducci, C., Gustafsson, S., Gyllensten, U., Hall, AS., Hall, P., Hallmans, G., Hamsten, A., Hansen, T., Haritunians, T., Harris, TB., van der Harst, P., Hartikainen, AL., Hassanali, N., Hattersley, AT., Havulinna, AS., Hayward, C., Heard-Costa, NL., Heath, AC., Hebebrand, J., Heid, IM., den Heijer, M., Hengstenberg, C., Herzig, KH., Hicks, AA., Hingorani, A., Hinney, A., Hirschhorn, JN., Hofman, A., Holmes, CC., Homuth, G., Hottenga, JJ., Hovingh, KG., Hu, FB., Hu, YJ., Huffman, JE., Hui, J., Huikuri, H., Humphries, SE., Hung, J., Hunt, SE., Hunter, D., Hveem, K., Hyppönen, E., Igl, W., Illig, T., Ingelsson, E., Iribarren, C., Isomaa, B., Jackson, AU., Jacobs, KB., James, AL., Jansson, JO., Jarick, I., Jarvelin, MR., Jöckel, KH., Johansson£££Åsa£££ Å., Johnson, T., Jolley, J., Jørgensen, T., Jousilahti, P., Jula, A., Justice, AE., Kaakinen, M., Kähönen, M., Kajantie, E., Kanoni, S., Kao, WH., Kaplan, LM., Kaplan, RC., Kaprio, J., Kapur, K., Karpe, F., Kathiresan, S., Kee, F., Keinanen-Kiukaanniemi, SM., Ketkar, S., Kettunen, J., Khaw, KT., Kiemeney, LA., Kilpeläinen, TO., Kinnunen, L., Kivimaki, M., Kivmaki, M., Van der Klauw MM., Kleber, ME., Knowles, JW., Koenig, W., Kolcic, I., Kolovou, G., König, IR., Koskinen, S., Kovacs, P., Kraft, P., Kraja, AT., Kristiansson, K., KrjutÅjkov, K., Kroemer, HK., Krohn, JP., Krzelj, V., Kuh, D., Kulzer, JR., Kumari, M., Kutalik£££Zoltán£££ Z., Kuulasmaa, K., Kuusisto, J., Kvaloy, K., Laakso, M., Laitinen, JH., Lakka, TA., Lamina, C., Langenberg, C., Lantieri, O., Lathrop, GM., Launer, LJ., Lawlor, DA., Lawrence, RW., Leach, IM., Lecoeur, C., Lee, SH., Lehtimäki, T., Leitzmann, MF., Lettre, G., Levinson, DF., Li, G., Li, S., Liang, L., Lin, DY., Lind, L., Lindgren, CM., Lindström, J., Liu, J., Liuzzi, A., Locke, AE., Lokki, ML., Loley, C., Loos, RJ., Lorentzon, M., Luan£££Jian'an£££ J., Luben, RN., Ludwig, B., Madden, PA., Mägi, R., Magnusson, PK., Mangino, M., Manunta, P., Marek, D., Marre, M., Martin, NG., März, W., Maschio, A., Mathieson, I., McArdle, WL., McCaroll, SA., McCarthy, A., McCarthy, MI., McKnight, B., Medina-Gomez, C., Medland, SE., Meitinger, T., Metspalu, A., van Meurs JB., Meyre, D., Midthjell, K., Mihailov, E., Milani, L., Min, JL., Moebus, S., Moffatt, MF., Mohlke, KL., Molony, C., Monda, KL., Montgomery, GW., Mooser, V., Morken, MA., Morris, AD., Morris, AP., Mühleisen, TW., Müller-Nurasyid, M., Munroe, PB., Musk, AW., Narisu, N., Navis, G., Neale, BM., Nelis, M., Nemesh, J., Neville, MJ., Ngwa, JS., Nicholson, G., Nieminen, MS., Njølstad, I., Nohr, EA., Nolte, IM., North, KE., Nöthen, MM., Nyholt, DR., O'Connell, JR., Ohlsson, C., Oldehinkel, AJ., van Ommen GJ., Ong, KK., Oostra, BA., Ouwehand, WH., Palmer, CN., Palmer, LJ., Palotie, A., Paré, G., Parker, AN., Paternoster, L., Pawitan, Y., Pechlivanis, S., Peden, JF., Pedersen, NL., Pedersen, O., Pellikka, N., Peltonen, L., Penninx, B., Perola, M., Perry, JR., Person, T., Peters, A., Peters, MJ., Pichler, I., Pietiläinen, KH., Platou, CG., Polasek, O., Pouta, A., Power, C., Pramstaller, PP., Preuss, M., Price, JF., Prokopenko, I., Province, MA., Psaty, BM., Purcell, S., Pütter, C., Qi, L., Quertermous, T., Radhakrishnan, A., Raitakari, O., Randall, JC., Rauramaa, R., Rayner, NW., Rehnberg, E., Rendon, A., Ridderstråle, M., Ridker, PM., Ripatti, S., Rissanen, A., Rivadeneira, F., Rivolta, C., Robertson, NR., Rose, LM., Rudan, I., Saaristo, TE., Sager, H., Salomaa, V., Samani, NJ., Sambrook, JG., Sanders, AR., Sandholt, C., Sanna, S., Saramies, J., Schadt, EE., Scherag, A., Schipf, S., Schlessinger, D., Schreiber, S., Schunkert, H., Schwarz, PE., Scott, LJ., Shi, J., Shin, SY., Shuldiner, AR., Shungin, D., Signorini, S., Silander, K., Sinisalo, J., Skrobek, B., Smit, JH., Smith, AV., Smith, GD., Snieder, H., Soranzo, N., Sørensen, TI., Sovio, U., Spector, TD., Speliotes, EK., Stančáková, A., Stark, K., Stefansson, K., Steinthorsdottir, V., Stephens, JC., Stirrups, K., Stolk, RP., Strachan, DP., Strawbridge, RJ., Stringham, HM., Stumvoll, M., Surakka, I., Swift, AJ., Syvanen, AC., Tammesoo, ML., Teder-Laving, M., Teslovich, TM., Teumer, A., Theodoraki, EV., Thomson, B., Thorand, B., Thorleifsson, G., Thorsteinsdottir, U., Timpson, NJ., Tönjes, A., Tregouet, DA., Tremoli, E., Trip, MD., Tuomi, T., Tuomilehto, J., Tyrer, J., Uda, M., Uitterlinden, AG., Usala, G., Uusitupa, M., Valle, TT., Vandenput, L., Vatin, V., Vedantam, S., de Vegt, F., Vermeulen, SH., Viikari, J., Virtamo, J., Visscher, PM., Vitart, V., Van Vliet-Ostaptchouk JV., Voight, BF., Vollenweider, P., Volpato, CB., Völzke, H., Waeber, G., Waite, LL., Wallaschofski, H., Walters, GB., Wang, Z., Wareham, NJ., Watanabe, RM., Watkins, H., Weedon, MN., Welch, R., Weyant, RJ., Wheeler, E., White, CC., Wichmann, HE., Widen, E., Wild, SH., Willemsen, G., Willer, CJ., Wilsgaard, T., Wilson, JF., van Wingerden, S., Winkelmann, BR., Winkler, TW., Witte, DR., Witteman, JC., Wolffenbuttel, BH., Wong, A., Wood, AR., Workalemahu, T., Wright, AF., Yang, J., Yarnell, JW., Zgaga, L., Zhao, JH., Zillikens, MC., Zitting, P., and Zondervan, KT.
- Subjects
Quality Control ,Netherlands Twin Register (NTR) ,BIO/12 - BIOCHIMICA CLINICA E BIOLOGIA MOLECOLARE CLINICA ,media_common.quotation_subject ,quality control, GWAMAS ,Control (management) ,Medizin ,Genome-wide association study ,Biology ,Bioinformatics ,General Biochemistry, Genetics and Molecular Biology ,Article ,03 medical and health sciences ,0302 clinical medicine ,Software ,SDG 17 - Partnerships for the Goals ,Meta-Analysis as Topic ,Comparable size ,Quality (business) ,030304 developmental biology ,media_common ,Protocol (science) ,0303 health sciences ,business.industry ,Software package ,Data science ,Urological cancers Radboud Institute for Health Sciences [Radboudumc 15] ,Genome-Wide Association Study/methods ,Inflammatory diseases Radboud Institute for Health Sciences [Radboudumc 5] ,quality control ,genome-wide association meta-analyses ,business ,030217 neurology & neurosurgery ,Genome-Wide Association Study - Abstract
Item does not contain fulltext Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.
- Published
- 2014
- Full Text
- View/download PDF
3. Encrypted accelerated least squares regression
- Author
-
Esperança, PM, Aslett, L, Holmes, CC, Singh, A, and Zhu, J
- Subjects
Computer Science::Multimedia ,Computer Science::Cryptography and Security - Abstract
Information that is stored in an encrypted format is, by definition, usually not amenable to statistical analysis or machine learning methods. In this paper we present detailed analysis of coordinate and accelerated gradient descent algorithms which are capable of fitting least squares and penalised ridge regression models, using data encrypted under a fully homomorphic encryption scheme. Gradient descent is shown to dominate in terms of encrypted computational speed, and theoretical results are proven to give parameter bounds which ensure correctness of decryption. The characteristics of encrypted computation are empirically shown to favour a non-standard acceleration technique. This demonstrates the possibility of approximating conventional statistical regression methods using encrypted data without compromising privacy.
- Published
- 2017
4. Micro-ribonucleic acid expression profiling and expression quantitative trait loci analysis in human gluteal and abdominal adipose tissue
- Author
-
Rantalainen, M, Herrera, BM, Nicholson, G, Wills, QF, Bowden, R, Neville, MJ, Randall, JC, Barrett, A, Allen, M, McCarthy, MI, Zondervan, KT, Karpe, F, Holmes, CC, and Lindgren, CM
- Published
- 2016
5. Artemether-Lumefantrine and Dihydroartemisinin-Piperaquine Exert Inverse Selective Pressure on Plasmodium Falciparum Drug Sensitivity-Associated Haplotypes in Uganda
- Author
-
Taylor, AR, Flegg, JA, Holmes, CC, Guerin, PJ, Sibley, CH, Conrad, MD, Dorsey, G, Rosenthal, PJ, Taylor, AR, Flegg, JA, Holmes, CC, Guerin, PJ, Sibley, CH, Conrad, MD, Dorsey, G, and Rosenthal, PJ
- Abstract
BACKGROUND: Altered sensitivity to multiple antimalarial drugs is mediated by polymorphisms in pfmdr1, which encodes the Plasmodium falciparum multidrug resistance transporter. In Africa the N86Y and D1246Y polymorphisms have been shown to be selected by treatment, with artemether-lumefantrine (AL) and dihydroartemisinin-piperaquine (DP) selecting for wild-type and mutant alleles, respectively. However, there has been little study of pfmdr1 haplotypes, in part because haplotype analyses are complicated by multiclonal infections. METHODS: We fit a haplotype frequency estimation model, which accounts for multiclonal infections, to the polymorphic pfmdr1 N86Y, Y184F, and D1246Y alleles in samples from a longitudinal trial comparing AL and DP to treat uncomplicated P falciparum malaria in Tororo, Uganda from 2007 to 2012. We regressed estimates onto covariates of trial arm and selective drug pressure. RESULTS: Yearly trends showed increasing frequency estimates for haplotypes with wild type pfmdr1 N86 and D1246 alleles and decreasing frequency estimates for haplotypes with the mutant pfmdr1 86Y allele. Considering days since prior therapy, we saw evidence suggestive of selection by AL for haplotypes with N86 combined with 184F, D1246, or both, and against all haplotypes with 86Y, and evidence suggestive of selection by DP for 86Y only when combined with Y184 and 1246Y (haplotype YYY) and against haplotypes NFD and NYY. CONCLUSIONS: Based on our model, AL selected several haplotypes containing N86, whereas DP selection was haplotype specific, demonstrating the importance of haplotype analyses. Inverse selective pressure of AL and DP on the complementary haplotypes NFD and YYY suggests that rotating artemisinin-based antimalarial combination regimens may be the best treatment option to prevent resistance selection.
- Published
- 2017
6. Encrypted statistical machine learning: new privacy preserving methods
- Author
-
Aslett, LJM, Esperança, PM, and Holmes, CC
- Subjects
Computer Science::Cryptography and Security - Abstract
We present two new statistical machine learning methods designed to learn on fully homomorphic encrypted (FHE) data. The introduction of FHE schemes following Gentry (2009) opens up the prospect of privacy preserving statistical machine learning analysis and modelling of encrypted data without compromising security constraints. We propose tailored algorithms for applying extremely random forests, involving a new cryptographic stochastic fraction estimator, and na\"{i}ve Bayes, involving a semi-parametric model for the class decision boundary, and show how they can be used to learn and predict from encrypted data. We demonstrate that these techniques perform competitively on a variety of classification data sets and provide detailed information about the computational practicalities of these and other FHE methods.
- Published
- 2015
7. A review of homomorphic encryption and software tools for encrypted statistical machine learning
- Author
-
Aslett, LJM, Esperança, PM, and Holmes, CC
- Abstract
Recent advances in cryptography promise to enable secure statistical computation on encrypted data, whereby a limited set of operations can be carried out without the need to first decrypt. We review these homomorphic encryption schemes in a manner accessible to statisticians and machine learners, focusing on pertinent limitations inherent in the current state of the art. These limitations restrict the kind of statistics and machine learning algorithms which can be implemented and we review those which have been successfully applied in the literature. Finally, we document a high performance R package implementing a recent homomorphic scheme in a general framework.
- Published
- 2015
8. A general framework for updating belief distributions
- Author
-
Bissiri, P, Holmes, C, Walker, S, BISSIRI, PIER GIOVANNI, Holmes, CC, Walker, SG, Bissiri, P, Holmes, C, Walker, S, BISSIRI, PIER GIOVANNI, Holmes, CC, and Walker, SG
- Abstract
We propose a framework for general Bayesian inference. We argue that a valid update of a prior belief distribution to a posterior can be made for parameters which are connected to observations through a loss function rather than the traditional likelihood function, which is recovered as a special case. Modern application areas make it increasingly challenging for Bayesians to attempt to model the true data-generating mechanism. For instance, when the object of interest is low dimensional, such as a mean or median, it is cumbersome to have to achieve this via a complete model for the whole data distribution. More importantly, there are settings where the parameter of interest does not directly index a family of density functions and thus the Bayesian approach to learning about such parameters is currently regarded as problematic. Our framework uses loss functions to connect information in the data to functionals of interest. The updating of beliefs then follows from a decision theoretic approach involving cumulative loss functions. Importantly, the procedure coincides with Bayesian updating when a true likelihood is known yet provides coherent subjective inference in much more general settings. Connections to other inference frameworks are highlighted.
- Published
- 2016
9. Survival in stage II/III colorectal cancer is independently predicted by chromosomal and microsatellite instability, but not by specific driver mutations
- Author
-
Mouradov, D, Domingo, E, Gibbs, P, Jorissen, RN, Li, S, Soo, PY, Lipton, L, Desai, J, Danielsen, HE, Oukrif, D, Novelli, M, Yau, C, Holmes, CC, Jones, IT, McLaughlin, S, Molloy, P, Hawkins, NJ, Ward, R, Midgely, R, Kerr, D, Tomlinson, IP, and Sieber, OM
- Subjects
Male ,Proto-Oncogene Proteins B-raf ,F-Box-WD Repeat-Containing Protein 7 ,Class I Phosphatidylinositol 3-Kinases ,Ubiquitin-Protein Ligases ,DNA Mutational Analysis ,Loss of Heterozygosity ,Cell Cycle Proteins ,Disease-Free Survival ,GTP Phosphohydrolases ,Proto-Oncogene Proteins p21(ras) ,Phosphatidylinositol 3-Kinases ,Proto-Oncogene Proteins ,Humans ,neoplasms ,Aged ,Neoplasm Staging ,Aged, 80 and over ,Chromosome Aberrations ,F-Box Proteins ,Membrane Proteins ,virus diseases ,Middle Aged ,Prognosis ,female genital diseases and pregnancy complications ,digestive system diseases ,Survival Rate ,Mutation ,ras Proteins ,Female ,Microsatellite Instability ,Tumor Suppressor Protein p53 ,Colorectal Neoplasms - Abstract
OBJECTIVES: Microsatellite instability (MSI) is an established marker of good prognosis in colorectal cancer (CRC). Chromosomal instability (CIN) is strongly negatively associated with MSI and has been shown to be a marker of poor prognosis in a small number of studies. However, a substantial group of "double-negative" (MSI-/CIN-) CRCs exists. The prognosis of these patients is unclear. Furthermore, MSI and CIN are each associated with specific molecular changes, such as mutations in KRAS and BRAF, that have been associated with prognosis. It is not known which of MSI, CIN, and the specific gene mutations are primary predictors of survival. METHODS: We evaluated the prognostic value (disease-free survival, DFS) of CIN, MSI, mutations in KRAS, NRAS, BRAF, PIK3CA, FBXW7, and TP53, and chromosome 18q loss-of-heterozygosity (LOH) in 822 patients from the VICTOR trial of stage II/III CRC. We followed up promising associations in an Australian community-based cohort (N=375). RESULTS: In the VICTOR patients, no specific mutation was associated with DFS, but individually MSI and CIN showed significant associations after adjusting for stage, age, gender, tumor location, and therapy. A combined analysis of the VICTOR and community-based cohorts showed that MSI and CIN were independent predictors of DFS (for MSI, hazard ratio (HR)=0.58, 95% confidence interval (CI) 0.36-0.93, and P=0.021; for CIN, HR=1.54, 95% CI 1.14-2.08, and P=0.005), and joint CIN/MSI testing significantly improved the prognostic prediction of MSI alone (P=0.028). Higher levels of CIN were monotonically associated with progressively poorer DFS, and a semi-quantitative measure of CIN was a better predictor of outcome than a simple CIN+/- variable. All measures of CIN predicted DFS better than the recently described Watanabe LOH ratio. CONCLUSIONS: MSI and CIN are independent predictors of DFS for stage II/III CRC. Prognostic molecular tests for CRC relapse should currently use MSI and a quantitative measure of CIN rather than specific gene mutations.
- Published
- 2013
10. Erratum: Germline mutations affecting the proofreading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas (Nature Genetics (2013) 45 (136-144))
- Author
-
Palles, C, Cazier, J-B, Howarth, KM, Domingo, E, Jones, AM, Broderick, P, Kemp, Z, Spain, SL, Almeida, EG, Salguero, I, Sherborne, A, Chubb, D, Carvajal-Carmona, LG, Ma, Y, Kaur, K, Dobbins, S, Barclay, E, Gorman, M, Martin, L, Kovac, MB, Humphray, S, Lucassen, A, Holmes, CC, Bentley, D, Donnelly, P, Taylor, J, Petridis, C, Roylance, R, Sawyer, EJ, Kerr, DJ, Clark, S, Grimes, J, Kearsey, SE, Thomas, HJW, McVean, G, Houlston, RS, and Tomlinson, I
- Published
- 2013
11. Statistical Inference in Hidden Markov Models using $k$-segment Constraints
- Author
-
Titsias, MK, Holmes, CC, and Yau, C
- Subjects
Methodology (stat.ME) ,FOS: Computer and information sciences ,Computer Science - Learning ,Theory and Methods ,Statistics - Machine Learning ,Segmentation ,Machine Learning (stat.ML) ,Hidden Markov models ,Dynamic programming ,Statistics - Methodology ,Article ,Machine Learning (cs.LG) - Abstract
Hidden Markov models (HMMs) are one of the most widely used statistical methods for analyzing sequence data. However, the reporting of output from HMMs has largely been restricted to the presentation of the most-probable (MAP) hidden state sequence, found via the Viterbi algorithm, or the sequence of most probable marginals using the forward-backward (F-B) algorithm. In this article, we expand the amount of information we could obtain from the posterior distribution of an HMM by introducing linear-time dynamic programming algorithms that, we collectively call $k$-segment algorithms, that allow us to i) find MAP sequences, ii) compute posterior probabilities and iii) simulate sample paths conditional on a user specified number of segments, i.e. contiguous runs in a hidden state, possibly of a particular type. We illustrate the utility of these methods using simulated and real examples and highlight the application of prospective and retrospective use of these methods for fitting HMMs or exploring existing model fits., Comment: 37 pages
- Published
- 2013
- Full Text
- View/download PDF
12. Erratum: Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution (Nature Genetics (2010) 42 (949-960))
- Author
-
Heid, IM, Jackson, AU, Randall, JC, Winkler, TW, Qi, L, Steinthorsdottir, V, Thorleifsson, G, Zillikens, MC, Speliotes, EK, Mägi, R, Workalemahu, T, White, CC, Bouatia-Naji, N, Harris, TB, Berndt, SI, Ingelsson, E, Willer, CJ, Weedon, MN, Luan, J, Vedantam, S, Esko, T, Kilpeläinen, TO, Kutalik, Z, Li, S, Monda, KL, Dixon, AL, Holmes, CC, Kaplan, LM, Liang, L, Min, JL, Moffatt, MF, Molony, C, Nicholson, G, Schadt, EE, Zondervan, KT, Feitosa, MF, Ferreira, T, Allen, HL, Weyant, RJ, Wheeler, E, Wood, AR, Estrada, K, Goddard, ME, Lettre, G, Mangino, M, Nyholt, DR, Purcell, S, Vernon Smith, A, Visscher, PM, Yang, J, McCarroll, SA, Nemesh, J, Voight, BF, Absher, D, Amin, N, Aspelund, T, Coin, L, Glazer, NL, Hayward, C, Heard-Costa, NL, Hottenga, J-J, Johansson, A, Johnson, T, Kaakinen, M, Kapur, K, Ketkar, S, Knowles, JW, Kraft, P, Kraja, AT, Lamina, C, Leitzmann, MF, McKnight, B, Morris, AP, Ong, KK, Perry, JRB, Peters, MJ, Polasek, O, Prokopenko, I, Rayner, NW, Ripatti, S, Rivadeneira, F, Robertson, NR, Sanna, S, Sovio, U, Surakka, I, Teumer, A, Van Wingerden, S, Vitart, V, Zhao, JH, Cavalcanti-Proença, C, Chines, PS, Fisher, E, Kulzer, JR, Lecoeur, C, Narisu, N, Sandholt, C, Scott, LJ, Silander, K, Stark, K, Tammesoo, M-L, Teslovich, TM, Timpson, NJ, Watanabe, RM, Welch, R, Chasman, DI, Cooper, MN, Jansson, J-O, Kettunen, J, Lawrence, RW, Pellikka, N, Perola, M, Vandenput, L, Alavere, H, Almgren, P, Atwood, LD, Bennett, AJ, Biffar, R, Bonnycastle, LL, Bornstein, SR, Buchanan, TA, Campbell, H, Day, INM, Dei, M, Dörr, M, Elliott, P, Erdos, MR, Eriksson, JG, Freimer, NB, Fu, M, Gaget, S, Geus, EJC, Gjesing, AP, Grallert, H, Gräßler, J, Groves, CJ, Guiducci, C, Hartikainen, A-L, Hassanali, N, Havulinna, AS, Herzig, K-H, Hicks, AA, Hui, J, Igl, W, Jousilahti, P, Jula, A, Kajantie, E, Kinnunen, L, Kolcic, I, Koskinen, S, Kovacs, P, Kroemer, HK, Krzelj, V, Kuusisto, J, Kvaloy, K, Laitinen, J, Lantieri, O, Lathrop, GM, Lokki, M-L, Luben, RN, Ludwig, B, McArdle, WL, McCarthy, A, Morken, MA, Nelis, M, Neville, MJ, Paré, G, Parker, AN, Peden, JF, Pichler, I, Pietiläinen, KH, Platou, CGP, Pouta, A, Ridderstråle, M, Samani, NJ, Saramies, J, Sinisalo, J, Smit, JH, Strawbridge, RJ, Stringham, HM, Swift, AJ, Teder-Laving, M, Thomson, B, Usala, G, Van Meurs, JBJ, Van Ommen, G-J, Vatin, V, Volpato, CB, Wallaschofski, H, Walters, GB, Widen, E, Wild, SH, Willemsen, G, Witte, DR, Zgaga, L, Zitting, P, Beilby, JP, James, AL, Kähönen, M, Lehtimäki, T, Nieminen, MS, Ohlsson, C, Palmer, LJ, Raitakari, O, Ridker, PM, Stumvoll, M, Tönjes, A, Viikari, J, Balkau, B, Ben-Shlomo, Y, Bergman, RN, Boeing, H, Smith, GD, Ebrahim, S, Froguel, P, Hansen, T, Hengstenberg, C, Hveem, K, Isomaa, B, Jørgensen, T, Karpe, F, Khaw, K-T, Laakso, M, Lawlor, DA, Marre, M, Meitinger, T, Metspalu, A, Midthjell, K, Pedersen, O, Salomaa, V, Schwarz, PEH, Tuomi, T, Tuomilehto, J, Valle, TT, Wareham, NJ, Arnold, AM, Beckmann, JS, Bergmann, S, Boerwinkle, E, Boomsma, DI, Caulfield, MJ, Collins, FS, Eiriksdottir, G, Gudnason, V, Gyllensten, U, Hamsten, A, Hattersley, AT, Hofman, A, Hu, FB, Illig, T, Iribarren, C, Jarvelin, M-R, Kao, WHL, Kaprio, J, Launer, LJ, Munroe, PB, Oostra, B, Penninx, BW, Pramstaller, PP, Psaty, BM, Quertermous, T, Rissanen, A, Rudan, I, Shuldiner, AR, Soranzo, N, Spector, TD, Syvanen, A-C, Uda, M, Uitterlinden, A, Völzke, H, Vollenweider, P, Wilson, JF, Witteman, JC, Wright, AF, Abecasis, GR, Boehnke, M, Borecki, IB, Deloukas, P, Frayling, TM, Groop, LC, Haritunians, T, Hunter, DJ, Kaplan, RC, North, KE, O'Connell, JR, Peltonen, L, Schlessinger, D, Strachan, DP, Hirschhorn, JN, Assimes, TL, Wichmann, H-E, Thorsteinsdottir, U, Van Duijn, CM, Stefansson, K, Cupples, LA, Loos, RJF, Barroso, I, McCarthy, MI, Fox, CS, Mohlke, KL, and Lindgren, CM
- Published
- 2011
13. Estimation of malaria haplotype and genotype frequencies: a statistical approach to overcome the challenge associated with multiclonal infections
- Author
-
Taylor, AR, Flegg, JA, Nsobya, SL, Yeka, A, Kamya, MR, Rosenthal, PJ, Dorsey, G, Sibley, CH, Guerin, PJ, Holmes, CC, Taylor, AR, Flegg, JA, Nsobya, SL, Yeka, A, Kamya, MR, Rosenthal, PJ, Dorsey, G, Sibley, CH, Guerin, PJ, and Holmes, CC
- Abstract
BACKGROUND: Reliable measures of anti-malarial resistance are crucial for malaria control. Resistance is typically a complex trait: multiple mutations in a single parasite (a haplotype or genotype) are necessary for elaboration of the resistant phenotype. The frequency of a genetic motif (proportion of parasite clones in the parasite population that carry a given allele, haplotype or genotype) is a useful measure of resistance. In areas of high endemicity, malaria patients generally harbour multiple parasite clones; they have multiplicities of infection (MOIs) greater than one. However, most standard experimental procedures only allow measurement of marker prevalence (proportion of patient blood samples that test positive for a given mutation or combination of mutations), not frequency. It is misleading to compare marker prevalence between sites that have different mean MOIs; frequencies are required instead. METHODS: A Bayesian statistical model was developed to estimate Plasmodium falciparum genetic motif frequencies from prevalence data collected in the field. To assess model performance and computational speed, a detailed simulation study was implemented. Application of the model was tested using datasets from five sites in Uganda. The datasets included prevalence data on markers of resistance to sulphadoxine-pyrimethamine and an average MOI estimate for each study site. RESULTS: The simulation study revealed that the genetic motif frequencies that were estimated using the model were more accurate and precise than conventional estimates based on direct counting. Importantly, the model did not require measurements of the MOI in each patient; it used the average MOI in the patient population. Furthermore, if a dataset included partially genotyped patient blood samples, the model imputed the data that were missing. Using the model and the Ugandan data, genotype frequencies were estimated and four biologically relevant genotypes were identified. CONCLUSIONS: The mod
- Published
- 2014
14. A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data
- Author
-
Yau, C, Mouradov, D, Jorissen, RN, Colella, S, Mirza, G, Steers, G, Harris, A, Ragoussis, J, Sieber, O, Holmes, CC, Yau, C, Mouradov, D, Jorissen, RN, Colella, S, Mirza, G, Steers, G, Harris, A, Ragoussis, J, Sieber, O, and Holmes, CC
- Abstract
We describe a statistical method for the characterization of genomic aberrations in single nucleotide polymorphism microarray data acquired from cancer genomes. Our approach allows us to model the joint effect of polyploidy, normal DNA contamination and intra-tumour heterogeneity within a single unified Bayesian framework. We demonstrate the efficacy of our method on numerous datasets including laboratory generated mixtures of normal-cancer cell lines and real primary tumours.
- Published
- 2010
15. Model-based geostatistics - Discussion
- Author
-
Webster, R., Lawson, Ab, Glasbey, C., Horgan, G., Elston, D., Host, G., Mugglestone, Ma, Kenward, Mg, Kent, Jt, Stein, A., Clifford, P., Ledford, Aw, Marriott, Pk, Aitkin, M., Atkinson, Ac, Boskov, M., Kelsall, J., Wakefield, J., Bowman, A., Casson, E., Cressie, N., Denison, Dgt, Mallick, Bk, Dixon, P., Scott, M., Haas, Tc, Handcock, Ms, Holmes, Cc, Laslett, G., Lele, S., Nadarajah, S., Anthony O'Hagan, Pettitt, An, Hay, J., Richardson, S., Stein, M., Stoyan, D., and Williams, Cki
- Published
- 1998
16. Statistical inference with exchangeability and martingales.
- Author
-
Holmes CC and Walker SG
- Abstract
In this paper, we start by reviewing exchangeability and its relevance to the Bayesian approach. We highlight the predictive nature of Bayesian models and the symmetry assumptions implied by beliefs of an underlying exchangeable sequence of observations. By taking a closer look at the Bayesian bootstrap, the parametric bootstrap of Efron and a version of Bayesian thinking about inference uncovered by Doob based on martingales, we introduce a parametric Bayesian bootstrap. Martingales play a fundamental role. Illustrations are presented as is the relevant theory. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.
- Published
- 2023
- Full Text
- View/download PDF
17. Improving statistical power in severe malaria genetic association studies by augmenting phenotypic precision.
- Author
-
Watson JA, Ndila CM, Uyoga S, Macharia A, Nyutu G, Mohammed S, Ngetsa C, Mturi N, Peshu N, Tsofa B, Rockett K, Leopold S, Kingston H, George EC, Maitland K, Day NP, Dondorp AM, Bejon P, Williams TN, Holmes CC, and White NJ
- Subjects
- Adolescent, Adult, Case-Control Studies, Child, Child, Preschool, Extracellular Matrix Proteins genetics, Female, Genomics, Humans, Kenya, Malaria, Falciparum, Male, Polymorphism, Genetic, Genome-Wide Association Study, Malaria diagnosis, Malaria epidemiology, Phenotype
- Abstract
Severe falciparum malaria has substantially affected human evolution. Genetic association studies of patients with clinically defined severe malaria and matched population controls have helped characterise human genetic susceptibility to severe malaria, but phenotypic imprecision compromises discovered associations. In areas of high malaria transmission, the diagnosis of severe malaria in young children and, in particular, the distinction from bacterial sepsis are imprecise. We developed a probabilistic diagnostic model of severe malaria using platelet and white count data. Under this model, we re-analysed clinical and genetic data from 2220 Kenyan children with clinically defined severe malaria and 3940 population controls, adjusting for phenotype mis-labelling. Our model, validated by the distribution of sickle trait, estimated that approximately one-third of cases did not have severe malaria. We propose a data-tilting approach for case-control studies with phenotype mis-labelling and show that this reduces false discovery rates and improves statistical power in genome-wide association studies., Competing Interests: JW, CN, SU, AM, GN, SM, CN, NM, NP, BT, KR, SL, HK, EG, KM, ND, AD, PB, TW, CH, NW No competing interests declared, (© 2021, Watson et al.)
- Published
- 2021
- Full Text
- View/download PDF
18. Correction to: Graphing and reporting heterogeneous treatment effects through reference classes.
- Author
-
Watson JA and Holmes CC
- Abstract
An amendment to this paper has been published and can be accessed via the original article.
- Published
- 2020
- Full Text
- View/download PDF
19. A cautionary note on the use of unsupervised machine learning algorithms to characterise malaria parasite population structure from genetic distance matrices.
- Author
-
Watson JA, Taylor AR, Ashley EA, Dondorp A, Buckee CO, White NJ, and Holmes CC
- Subjects
- Algorithms, Antimalarials therapeutic use, Cambodia epidemiology, Cluster Analysis, Drug Resistance genetics, Genotype, Humans, Malaria, Falciparum drug therapy, Malaria, Falciparum genetics, Malaria, Falciparum parasitology, Plasmodium falciparum pathogenicity, Unsupervised Machine Learning, Genetics, Population statistics & numerical data, Malaria, Falciparum epidemiology, Molecular Epidemiology, Plasmodium falciparum genetics
- Abstract
Genetic surveillance of malaria parasites supports malaria control programmes, treatment guidelines and elimination strategies. Surveillance studies often pose questions about malaria parasite ancestry (e.g. how antimalarial resistance has spread) and employ statistical methods that characterise parasite population structure. Many of the methods used to characterise structure are unsupervised machine learning algorithms which depend on a genetic distance matrix, notably principal coordinates analysis (PCoA) and hierarchical agglomerative clustering (HAC). PCoA and HAC are sensitive to both the definition of genetic distance and algorithmic specification. Importantly, neither algorithm infers malaria parasite ancestry. As such, PCoA and HAC can inform (e.g. via exploratory data visualisation and hypothesis generation), but not answer comprehensively, key questions about malaria parasite ancestry. We illustrate the sensitivity of PCoA and HAC using 393 Plasmodium falciparum whole genome sequences collected from Cambodia and neighbouring regions (where antimalarial resistance has emerged and spread recently) and we provide tentative guidance for the use and interpretation of PCoA and HAC in malaria parasite genetic epidemiology. This guidance includes a call for fully transparent and reproducible analysis pipelines that feature (i) a clearly outlined scientific question; (ii) a clear justification of analytical methods used to answer the scientific question along with discussion of any inferential limitations; (iii) publicly available genetic distance matrices when downstream analyses depend on them; and (iv) sensitivity analyses. To bridge the inferential disconnect between the output of non-inferential unsupervised learning algorithms and the scientific questions of interest, tailor-made statistical models are needed to infer malaria parasite ancestry. In the absence of such models speculative reasoning should feature only as discussion but not as results., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
20. Graphing and reporting heterogeneous treatment effects through reference classes.
- Author
-
Watson JA and Holmes CC
- Subjects
- Algorithms, Humans, Malaria, Falciparum mortality, Mortality, Outcome Assessment, Health Care, Precision Medicine, Predictive Value of Tests, Randomized Controlled Trials as Topic, Risk Reduction Behavior, Therapeutic Uses, Forecasting methods, Malaria, Falciparum therapy, Research Design trends
- Abstract
Background: Exploration and modelling of heterogeneous treatment effects as a function of baseline covariates is an important aspect of precision medicine in randomised controlled trials (RCTs). Randomisation generally guarantees the internal validity of an RCT, but heterogeneity in treatment effect can reduce external validity. Estimation of heterogeneous treatment effects is usually done via a predictive model for individual outcomes, where one searches for interactions between treatment allocation and important patient baseline covariates. However, such models are prone to overfitting and multiple testing and typically demand a transformation of the outcome measurement, for example, from the absolute risk in the original RCT to log-odds of risk in the predictive model., Methods: We show how reference classes derived from baseline covariates can be used to explore heterogeneous treatment effects via a two-stage approach. We first estimate a risk score which captures on a single dimension some of the heterogeneity in outcomes of the trial population. Heterogeneity in the treatment effect can then be explored via reweighting schemes along this axis of variation. This two-stage approach bypasses the search for interactions with multiple covariates, thus protecting against multiple testing. It also allows for exploration of heterogeneous treatment effects on the original outcome scale of the RCT. This approach would typically be applied to multivariable models of baseline risk to assess the stability of average treatment effects with respect to the distribution of risk in the population studied., Case Study: We illustrate this approach using the single largest randomised treatment trial in severe falciparum malaria and demonstrate how the estimated treatment effect in terms of absolute mortality risk reduction increases considerably in higher risk strata., Conclusions: 'Local' and 'tilting' reweighting schemes based on ranking patients by baseline risk can be used as a general approach for exploring, graphing and reporting heterogeneity of treatment effect in RCTs., Trial Registration: ISRCTN clinical trials registry: ISRCTN50258054. Prospectively registered on 22 July 2005.
- Published
- 2020
- Full Text
- View/download PDF
21. The value of early physical maturity to young adult labor market outcomes.
- Author
-
Holmes CC, Tracey MR, and Belasen AR
- Subjects
- Adolescent, Adult, Beauty, Body Height, Female, Humans, Longitudinal Studies, Male, Occupations, Young Adult, Employment statistics & numerical data, Income statistics & numerical data, Puberty physiology
- Abstract
Puberty is the most important developmental milestone closely preceding a young adult's labor market decisions. Thus, we examine the variation in the timing of physical maturity during adolescence to isolate its association with employment and hourly wages for US young adults. Using the National Longitudinal Study of Adolescent to Adult Health data, we find an early maturity premium of about 6% for females and 8% for males, but no employment advantage, in excess of gains from height and physical attractiveness. Cognitive and personality factors significantly explain this premium for both genders, but job attributes are also important for males., (Copyright © 2020 Elsevier B.V. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
22. Machine learning analysis plans for randomised controlled trials: detecting treatment effect heterogeneity with strict control of type I error.
- Author
-
Watson JA and Holmes CC
- Subjects
- Adult, Algorithms, Asia epidemiology, Humans, Malaria, Falciparum epidemiology, Malaria, Falciparum parasitology, Retrospective Studies, Treatment Outcome, Antimalarials therapeutic use, Artesunate therapeutic use, Machine Learning, Malaria, Falciparum drug therapy, Plasmodium falciparum drug effects, Quinine therapeutic use, Randomized Controlled Trials as Topic
- Abstract
Background: Retrospective exploratory analyses of randomised controlled trials (RCTs) seeking to identify treatment effect heterogeneity (TEH) are prone to bias and false positives. Yet the desire to learn all we can from exhaustive data measurements on trial participants motivates the inclusion of such analyses within RCTs. Moreover, widespread advances in machine learning (ML) methods hold potential to utilise such data to identify subjects exhibiting heterogeneous treatment response., Methods: We present a novel analysis strategy for detecting TEH in randomised data using ML methods, whilst ensuring proper control of the false positive discovery rate. Our approach uses random data partitioning with statistical or ML-based prediction on held-out data. This method can test for both crossover TEH (switch in optimal treatment) and non-crossover TEH (systematic variation in benefit across patients). The former is done via a two-sample hypothesis test measuring overall predictive performance. The latter is done via 'stacking' the ML predictors alongside a classical statistical model to formally test the added benefit of the ML algorithm. An adaptation of recent statistical theory allows for the construction of a valid aggregate p value. This testing strategy is independent of the choice of ML method., Results: We demonstrate our approach with a re-analysis of the SEAQUAMAT trial, which compared quinine to artesunate for the treatment of severe malaria in Asian adults. We find no evidence for any subgroup who would benefit from a change in treatment from the current standard of care, artesunate, but strong evidence for significant TEH within the artesunate treatment group. In particular, we find that artesunate provides a differential benefit to patients with high numbers of circulating ring stage parasites., Conclusions: ML analysis plans using computational notebooks (documents linked to a programming language that capture the model parameter settings, data processing choices, and evaluation criteria) along with version control can improve the robustness and transparency of RCT exploratory analyses. A data-partitioning algorithm allows researchers to apply the latest ML techniques safe in the knowledge that any declared associations are statistically significant at a user-defined level.
- Published
- 2020
- Full Text
- View/download PDF
23. Transcriptome Deconvolution of Heterogeneous Tumor Samples with Immune Infiltration.
- Author
-
Wang Z, Cao S, Morris JS, Ahn J, Liu R, Tyekucheva S, Gao F, Li B, Lu W, Tang X, Wistuba II, Bowden M, Mucci L, Loda M, Parmigiani G, Holmes CC, and Wang W
- Abstract
Transcriptome deconvolution in cancer and other heterogeneous tissues remains challenging. Available methods lack the ability to estimate both component-specific proportions and expression profiles for individual samples. We present DeMixT, a new tool to deconvolve high-dimensional data from mixtures of more than two components. DeMixT implements an iterated conditional mode algorithm and a novel gene-set-based component merging approach to improve accuracy. In a series of experimental validation studies and application to TCGA data, DeMixT showed high accuracy. Improved deconvolution is an important step toward linking tumor transcriptomic data with clinical outcomes. An R package, scripts, and data are available: https://github.com/wwylab/DeMixTallmaterials., (Published by Elsevier Inc.)
- Published
- 2018
- Full Text
- View/download PDF
24. High-throughput mouse phenomics for characterizing mammalian gene function.
- Author
-
Brown SDM, Holmes CC, Mallon AM, Meehan TF, Smedley D, and Wells S
- Subjects
- Animals, Humans, Mice, Databases, Genetic, Genetic Variation, Genome, Genomics methods
- Abstract
We are entering a new era of mouse phenomics, driven by large-scale and economical generation of mouse mutants coupled with increasingly sophisticated and comprehensive phenotyping. These studies are generating large, multidimensional gene-phenotype data sets, which are shedding new light on the mammalian genome landscape and revealing many hitherto unknown features of mammalian gene function. Moreover, these phenome resources provide a wealth of disease models and can be integrated with human genomics data as a powerful approach for the interpretation of human genetic variation and its relationship to disease. In the future, the development of novel phenotyping platforms allied to improved computational approaches, including machine learning, for the analysis of phenotype data will continue to enhance our ability to develop a comprehensive and powerful model of mammalian gene-phenotype space.
- Published
- 2018
- Full Text
- View/download PDF
25. Scalable Bayesian nonparametric measures for exploring pairwise dependence via Dirichlet Process Mixtures.
- Author
-
Filippi S, Holmes CC, and Nieto-Barajas LE
- Abstract
In this article we propose novel Bayesian nonparametric methods using Dirichlet Process Mixture (DPM) models for detecting pairwise dependence between random variables while accounting for uncertainty in the form of the underlying distributions. A key criteria is that the procedures should scale to large data sets. In this regard we find that the formal calculation of the Bayes factor for a dependent-vs.-independent DPM joint probability measure is not feasible computationally. To address this we present Bayesian diagnostic measures for characterising evidence against a "null model" of pairwise independence. In simulation studies, as well as for a real data analysis, we show that our approach provides a useful tool for the exploratory nonparametric Bayesian analysis of large multivariate data sets.
- Published
- 2016
- Full Text
- View/download PDF
26. A general framework for updating belief distributions.
- Author
-
Bissiri PG, Holmes CC, and Walker SG
- Abstract
We propose a framework for general Bayesian inference. We argue that a valid update of a prior belief distribution to a posterior can be made for parameters which are connected to observations through a loss function rather than the traditional likelihood function, which is recovered as a special case. Modern application areas make it increasingly challenging for Bayesians to attempt to model the true data-generating mechanism. For instance, when the object of interest is low dimensional, such as a mean or median, it is cumbersome to have to achieve this via a complete model for the whole data distribution. More importantly, there are settings where the parameter of interest does not directly index a family of density functions and thus the Bayesian approach to learning about such parameters is currently regarded as problematic. Our framework uses loss functions to connect information in the data to functionals of interest. The updating of beliefs then follows from a decision theoretic approach involving cumulative loss functions. Importantly, the procedure coincides with Bayesian updating when a true likelihood is known yet provides coherent subjective inference in much more general settings. Connections to other inference frameworks are highlighted.
- Published
- 2016
- Full Text
- View/download PDF
27. Artemether-Lumefantrine and Dihydroartemisinin-Piperaquine Exert Inverse Selective Pressure on Plasmodium Falciparum Drug Sensitivity-Associated Haplotypes in Uganda.
- Author
-
Taylor AR, Flegg JA, Holmes CC, Guérin PJ, Sibley CH, Conrad MD, Dorsey G, and Rosenthal PJ
- Abstract
Background: Altered sensitivity to multiple antimalarial drugs is mediated by polymorphisms in pfmdr1 , which encodes the Plasmodium falciparum multidrug resistance transporter. In Africa the N86Y and D1246Y polymorphisms have been shown to be selected by treatment, with artemether-lumefantrine (AL) and dihydroartemisinin-piperaquine (DP) selecting for wild-type and mutant alleles, respectively. However, there has been little study of pfmdr1 haplotypes, in part because haplotype analyses are complicated by multiclonal infections., Methods: We fit a haplotype frequency estimation model, which accounts for multiclonal infections, to the polymorphic pfmdr1 N86Y, Y184F, and D1246Y alleles in samples from a longitudinal trial comparing AL and DP to treat uncomplicated P falciparum malaria in Tororo, Uganda from 2007 to 2012. We regressed estimates onto covariates of trial arm and selective drug pressure., Results: Yearly trends showed increasing frequency estimates for haplotypes with wild type pfmdr1 N86 and D1246 alleles and decreasing frequency estimates for haplotypes with the mutant pfmdr1 86Y allele. Considering days since prior therapy, we saw evidence suggestive of selection by AL for haplotypes with N86 combined with 184F, D1246, or both, and against all haplotypes with 86Y, and evidence suggestive of selection by DP for 86Y only when combined with Y184 and 1246Y (haplotype YYY) and against haplotypes NFD and NYY., Conclusions: Based on our model, AL selected several haplotypes containing N86, whereas DP selection was haplotype specific, demonstrating the importance of haplotype analyses. Inverse selective pressure of AL and DP on the complementary haplotypes NFD and YYY suggests that rotating artemisinin-based antimalarial combination regimens may be the best treatment option to prevent resistance selection.
- Published
- 2016
- Full Text
- View/download PDF
28. Scalable Bayesian nonparametric regression via a Plackett-Luce model for conditional ranks.
- Author
-
Gray-Davies T, Holmes CC, and Caron F
- Abstract
We present a novel Bayesian nonparametric regression model for covariates X and continuous response variable Y ∈ ℝ. The model is parametrized in terms of marginal distributions for Y and X and a regression function which tunes the stochastic ordering of the conditional distributions F ( y|x ). By adopting an approximate composite likelihood approach, we show that the resulting posterior inference can be decoupled for the separate components of the model. This procedure can scale to very large datasets and allows for the use of standard, existing, software from Bayesian nonparametric density estimation and Plackett-Luce ranking estimation to be applied. As an illustration, we show an application of our approach to a US Census dataset, with over 1,300,000 data points and more than 100 covariates.
- Published
- 2016
- Full Text
- View/download PDF
29. Statistical Inference in Hidden Markov Models Using k -Segment Constraints.
- Author
-
Titsias MK, Holmes CC, and Yau C
- Abstract
Hidden Markov models (HMMs) are one of the most widely used statistical methods for analyzing sequence data. However, the reporting of output from HMMs has largely been restricted to the presentation of the most-probable (MAP) hidden state sequence, found via the Viterbi algorithm, or the sequence of most probable marginals using the forward-backward algorithm. In this article, we expand the amount of information we could obtain from the posterior distribution of an HMM by introducing linear-time dynamic programming recursions that, conditional on a user-specified constraint in the number of segments, allow us to (i) find MAP sequences, (ii) compute posterior probabilities, and (iii) simulate sample paths. We collectively call these recursions k -segment algorithms and illustrate their utility using simulated and real examples. We also highlight the prospective and retrospective use of k -segment constraints for fitting HMMs or exploring existing model fits. Supplementary materials for this article are available online.
- Published
- 2016
- Full Text
- View/download PDF
30. Robust Linear Models for Cis-eQTL Analysis.
- Author
-
Rantalainen M, Lindgren CM, and Holmes CC
- Subjects
- Databases, Genetic, Gene Expression, Humans, Polymorphism, Single Nucleotide, Linear Models, Quantitative Trait Loci
- Abstract
Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates, assuming an allelic dosage model and a Gaussian error term. However, gene expression data generally have noise that induces heavy-tailed errors relative to the Gaussian distribution and often include atypical observations, or outliers. Such departures from modelling assumptions can lead to an increased rate of type II errors (false negatives), and to some extent also type I errors (false positives). Careful model checking can reduce the risk of type-I errors but often not type II errors, since it is generally too time-consuming to carefully check all models with a non-significant effect in large-scale and genome-wide studies. Here we propose the application of a robust linear model for eQTL analysis to reduce adverse effects of deviations from the assumption of Gaussian residuals. We present results from a simulation study as well as results from the analysis of real eQTL data sets. Our findings suggest that in many situations robust models have the potential to provide more reliable eQTL results compared to conventional linear models, particularly in respect to reducing type II errors due to non-Gaussian noise. Post-genomic data, such as that generated in genome-wide eQTL studies, are often noisy and frequently contain atypical observations. Robust statistical models have the potential to provide more reliable results and increased statistical power under non-Gaussian conditions. The results presented here suggest that robust models should be considered routinely alongside other commonly used methodologies for eQTL analysis.
- Published
- 2015
- Full Text
- View/download PDF
31. Distinct developmental profile of lower-body adipose tissue defines resistance against obesity-associated metabolic complications.
- Author
-
Pinnick KE, Nicholson G, Manolopoulos KN, McQuaid SE, Valet P, Frayn KN, Denton N, Min JL, Zondervan KT, Fleckner J, McCarthy MI, Holmes CC, and Karpe F
- Subjects
- Abdominal Fat metabolism, Adult, DNA Methylation, Female, Humans, Intra-Abdominal Fat metabolism, Male, Middle Aged, Risk Factors, Subcutaneous Fat, Abdominal metabolism, T-Box Domain Proteins metabolism, Adipose Tissue metabolism, Cardiovascular Diseases metabolism, Obesity metabolism
- Abstract
Upper- and lower-body fat depots exhibit opposing associations with obesity-related metabolic disease. We defined the relationship between DEXA-quantified fat depots and diabetes/cardiovascular risk factors in a healthy population-based cohort (n = 3,399). Gynoid fat mass correlated negatively with insulin resistance after total fat mass adjustment, whereas the opposite was seen for abdominal fat. Paired transcriptomic analysis of gluteal subcutaneous adipose tissue (GSAT) and abdominal subcutaneous adipose tissue (ASAT) was performed across the BMI spectrum (n = 49; 21.4-45.5 kg/m(2)). In both depots, energy-generating metabolic genes were negatively associated and inflammatory genes were positively associated with obesity. However, associations were significantly weaker in GSAT. At the systemic level, arteriovenous release of the proinflammatory cytokine interleukin-6 (n = 34) was lower from GSAT than ASAT. Isolated preadipocytes retained a depot-specific transcriptional "memory" of embryonic developmental genes and exhibited differential promoter DNA methylation of selected genes (HOTAIR, TBX5) between GSAT and ASAT. Short hairpin RNA-mediated silencing identified TBX5 as a regulator of preadipocyte proliferation and adipogenic differentiation in ASAT. In conclusion, intrinsic differences in the expression of developmental genes in regional adipocytes provide a mechanistic basis for diversity in adipose tissue (AT) function. The less inflammatory nature of lower-body AT offers insight into the opposing metabolic disease risk associations between upper- and lower-body obesity., (© 2014 by the American Diabetes Association. Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered.)
- Published
- 2014
- Full Text
- View/download PDF
32. Estimation of malaria haplotype and genotype frequencies: a statistical approach to overcome the challenge associated with multiclonal infections.
- Author
-
Taylor AR, Flegg JA, Nsobya SL, Yeka A, Kamya MR, Rosenthal PJ, Dorsey G, Sibley CH, Guerin PJ, and Holmes CC
- Subjects
- Genotype, Haplotypes, Humans, Malaria, Falciparum epidemiology, Models, Statistical, Plasmodium falciparum classification, Plasmodium falciparum drug effects, Prevalence, Uganda, Drug Resistance, Gene Frequency, Malaria, Falciparum parasitology, Plasmodium falciparum genetics, Plasmodium falciparum isolation & purification
- Abstract
Background: Reliable measures of anti-malarial resistance are crucial for malaria control. Resistance is typically a complex trait: multiple mutations in a single parasite (a haplotype or genotype) are necessary for elaboration of the resistant phenotype. The frequency of a genetic motif (proportion of parasite clones in the parasite population that carry a given allele, haplotype or genotype) is a useful measure of resistance. In areas of high endemicity, malaria patients generally harbour multiple parasite clones; they have multiplicities of infection (MOIs) greater than one. However, most standard experimental procedures only allow measurement of marker prevalence (proportion of patient blood samples that test positive for a given mutation or combination of mutations), not frequency. It is misleading to compare marker prevalence between sites that have different mean MOIs; frequencies are required instead., Methods: A Bayesian statistical model was developed to estimate Plasmodium falciparum genetic motif frequencies from prevalence data collected in the field. To assess model performance and computational speed, a detailed simulation study was implemented. Application of the model was tested using datasets from five sites in Uganda. The datasets included prevalence data on markers of resistance to sulphadoxine-pyrimethamine and an average MOI estimate for each study site., Results: The simulation study revealed that the genetic motif frequencies that were estimated using the model were more accurate and precise than conventional estimates based on direct counting. Importantly, the model did not require measurements of the MOI in each patient; it used the average MOI in the patient population. Furthermore, if a dataset included partially genotyped patient blood samples, the model imputed the data that were missing. Using the model and the Ugandan data, genotype frequencies were estimated and four biologically relevant genotypes were identified., Conclusions: The model allows fast, accurate, reliable estimation of the frequency of genetic motifs associated with resistance to anti-malarials using prevalence data collected from malaria patients. The model does not require per-patient MOI measurements and can easily analyse data from five markers. The model will be a valuable tool for monitoring markers of anti-malarial drug resistance, including markers of resistance to artemisinin derivatives and partner drugs.
- Published
- 2014
- Full Text
- View/download PDF
33. NucleoFinder: a statistical approach for the detection of nucleosome positions.
- Author
-
Becker J, Yau C, Hancock JM, and Holmes CC
- Subjects
- Cell Line, Genome, Human, High-Throughput Nucleotide Sequencing, Humans, Sequence Analysis, DNA, Models, Statistical, Nucleosomes chemistry
- Abstract
Motivation: The identification of nucleosomes along the chromatin is key to understanding their role in the regulation of gene expression and other DNA-related processes. However, current experimental methods (MNase-ChIP, MNase-Seq) sample nucleosome positions from a cell population and contain biases, making thus the precise identification of individual nucleosomes not straightforward. Recent works have only focused on the first point, where noise reduction approaches have been developed to identify nucleosome positions., Results: In this article, we propose a new approach, termed NucleoFinder, that addresses both the positional heterogeneity across cells and experimental biases by seeking nucleosomes consistently positioned in a cell population and showing a significant enrichment relative to a control sample. Despite the absence of validated dataset, we show that our approach (i) detects fewer false positives than two other nucleosome calling methods and (ii) identifies two important features of the nucleosome organization (the nucleosome spacing downstream of active promoters and the enrichment/depletion of GC/AT dinucleotides at the centre of in vitro nucleosomes) with equal or greater ability than the other two methods.
- Published
- 2013
- Full Text
- View/download PDF
34. Germline mutations affecting the proofreading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas.
- Author
-
Palles C, Cazier JB, Howarth KM, Domingo E, Jones AM, Broderick P, Kemp Z, Spain SL, Guarino E, Salguero I, Sherborne A, Chubb D, Carvajal-Carmona LG, Ma Y, Kaur K, Dobbins S, Barclay E, Gorman M, Martin L, Kovac MB, Humphray S, Lucassen A, Holmes CC, Bentley D, Donnelly P, Taylor J, Petridis C, Roylance R, Sawyer EJ, Kerr DJ, Clark S, Grimes J, Kearsey SE, Thomas HJ, McVean G, Houlston RS, and Tomlinson I
- Subjects
- Exodeoxyribonucleases genetics, Genetic Linkage, Genome-Wide Association Study, Germ-Line Mutation genetics, Humans, Microsatellite Repeats genetics, Pedigree, Poly-ADP-Ribose Binding Proteins, Schizosaccharomyces genetics, Sequence Analysis, DNA, Adenoma genetics, Colorectal Neoplasms genetics, DNA Mismatch Repair genetics, DNA Polymerase II genetics, DNA Polymerase III genetics, DNA Replication genetics, Models, Molecular
- Abstract
Many individuals with multiple or large colorectal adenomas or early-onset colorectal cancer (CRC) have no detectable germline mutations in the known cancer predisposition genes. Using whole-genome sequencing, supplemented by linkage and association analysis, we identified specific heterozygous POLE or POLD1 germline variants in several multiple-adenoma and/or CRC cases but in no controls. The variants associated with susceptibility, POLE p.Leu424Val and POLD1 p.Ser478Asn, have high penetrance, and POLD1 mutation was also associated with endometrial cancer predisposition. The mutations map to equivalent sites in the proofreading (exonuclease) domain of DNA polymerases ɛ and δ and are predicted to cause a defect in the correction of mispaired bases inserted during DNA replication. In agreement with this prediction, the tumors from mutation carriers were microsatellite stable but tended to acquire base substitution mutations, as confirmed by yeast functional assays. Further analysis of published data showed that the recently described group of hypermutant, microsatellite-stable CRCs is likely to be caused by somatic POLE mutations affecting the exonuclease domain.
- Published
- 2013
- Full Text
- View/download PDF
35. Integrative network-based Bayesian analysis of diverse genomics data.
- Author
-
Wang W, Baladandayuthapani V, Holmes CC, and Do KA
- Subjects
- Atlases as Topic, Computer Simulation, Humans, MicroRNAs genetics, Normal Distribution, RNA, Messenger genetics, Software, Survival Analysis, Bayes Theorem, Genomics methods, Glioblastoma genetics, Neoplasms genetics, Systems Integration
- Abstract
Background: In order to better understand cancer as a complex disease with multiple genetic and epigenetic factors, it is vital to model the fundamental biological relationships among these alterations as well as their relationships with important clinical outcomes., Methods: We develop an integrative network-based Bayesian analysis (iNET) approach that allows us to jointly analyze multi-platform high-dimensional genomic data in a computationally efficient manner. The iNET approach is formulated as an objective Bayesian model selection problem for Gaussian graphical models to model joint dependencies among platform-specific features using known biological mechanisms. Using both simulated datasets and a glioblastoma (GBM) study from The Cancer Genome Atlas (TCGA), we illustrate the iNET approach via integrating three data types, microRNA, gene expression (mRNA), and patient survival time., Results: We show that the iNET approach has greater power in identifying cancer-related microRNAs than non-integrative approaches based on realistic simulated datasets. In the TCGA GBM study, we found many mRNA-microRNA pairs and microRNAs that are associated with patient survival time, with some of these associations identified in previous studies., Conclusions: The iNET discovers relationships consistent with the underlying biological mechanisms among these variables, as well as identifying important biomarkers that are potentially relevant to patient survival. In addition, we identified some microRNAs that can potentially affect patient survival which are missed by non-integrative approaches.
- Published
- 2013
- Full Text
- View/download PDF
36. Methods for qPCR gene expression profiling applied to 1440 lymphoblastoid single cells.
- Author
-
Livak KJ, Wills QF, Tipping AJ, Datta K, Mittal R, Goldson AJ, Sexton DW, and Holmes CC
- Subjects
- Cell Line, Data Interpretation, Statistical, Humans, Limit of Detection, Reverse Transcriptase Polymerase Chain Reaction, Wnt Signaling Pathway, Gene Expression Profiling methods, Lymphoid Progenitor Cells metabolism, Real-Time Polymerase Chain Reaction, Single-Cell Analysis
- Abstract
The stochastic nature of generating eukaryotic transcripts challenges conventional methods for obtaining and analyzing single-cell gene expression data. In order to address the inherent noise, detailed methods are described on how to collect data on multiple genes in a large number of single cells using microfluidic arrays. As part of a study exploring the effect of genotype on Wnt pathway activation, data were collected for 96 qPCR assays on 1440 lymphoblastoid cells. The description of methods includes preliminary data processing steps. The methods used in the collection and analysis of single-cell qPCR data are contrasted with those used in conventional qPCR., (Copyright © 2012 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
37. GREVE: Genomic Recurrent Event ViEwer to assist the identification of patterns across individual cancer samples.
- Author
-
Cazier JB, Holmes CC, and Broxholme J
- Subjects
- Chromosome Breakpoints, Humans, Chromosome Aberrations, Genome, Human, Neoplasms genetics, Software
- Abstract
Summary: GREVE has been developed to assist with the identification of recurrent genomic aberrations across cancer samples. The exact characterization of such aberrations remains a challenge despite the availability of increasing amount of data, from SNParray to next-generation sequencing. Furthermore, genomic aberrations in cancer are especially difficult to handle because they are, by nature, unique to the patients. However, their recurrence in specific regions of the genome has been shown to reflect their relevance in the development of tumors. GREVE makes use of previously characterized events to identify such regions and focus any further analysis., Availability: GREVE is available through a web interface and open-source application (http://www.well.ox.ac.uk/GREVE).
- Published
- 2012
- Full Text
- View/download PDF
38. Reprioritizing genetic associations in hit regions using LASSO-based resample model averaging.
- Author
-
Valdar W, Sabourin J, Nobel A, and Holmes CC
- Subjects
- Algorithms, Bayes Theorem, Calibration, Case-Control Studies, Chromosome Mapping, Computer Simulation, Genotype, Humans, Models, Genetic, Models, Statistical, Models, Theoretical, Molecular Epidemiology methods, ROC Curve, Regression Analysis, Genome-Wide Association Study methods
- Abstract
Significance testing one SNP at a time has proven useful for identifying genomic regions that harbor variants affecting human disease. But after an initial genome scan has identified a "hit region" of association, single-locus approaches can falter. Local linkage disequilibrium (LD) can make both the number of underlying true signals and their identities ambiguous. Simultaneous modeling of multiple loci should help. However, it is typically applied ad hoc: conditioning on the top SNPs, with limited exploration of the model space and no assessment of how sensitive model choice was to sampling variability. Formal alternatives exist but are seldom used. Bayesian variable selection is coherent but requires specifying a full joint model, including priors on parameters and the model space. Penalized regression methods (e.g., LASSO) appear promising but require calibration, and, once calibrated, lead to a choice of SNPs that can be misleadingly decisive. We present a general method for characterizing uncertainty in model choice that is tailored to reprioritizing SNPs within a hit region under strong LD. Our method, LASSO local automatic regularization resample model averaging (LLARRMA), combines LASSO shrinkage with resample model averaging and multiple imputation, estimating for each SNP the probability that it would be included in a multi-SNP model in alternative realizations of the data. We apply LLARRMA to simulations based on case-control genome-wide association studies data, and find that when there are several causal loci and strong LD, LLARRMA identifies a set of candidates that is enriched for true signals relative to single locus analysis and to the recently proposed method of Stability Selection., (© 2012 Wiley Periodicals, Inc.)
- Published
- 2012
- Full Text
- View/download PDF
39. Quantification of subclonal distributions of recurrent genomic aberrations in paired pre-treatment and relapse samples from patients with B-cell chronic lymphocytic leukemia.
- Author
-
Knight SJ, Yau C, Clifford R, Timbs AT, Sadighi Akha E, Dréau HM, Burns A, Ciria C, Oscier DG, Pettitt AR, Dutton S, Holmes CC, Taylor J, Cazier JB, and Schuh A
- Subjects
- Adult, Aged, Aged, 80 and over, Female, Gene Dosage, Genome, Human, Genomics, Humans, Leukemia, Lymphocytic, Chronic, B-Cell therapy, Loss of Heterozygosity, Male, Middle Aged, Neoplasm Recurrence, Local therapy, Oligonucleotide Array Sequence Analysis, Prognosis, Biomarkers, Tumor genetics, Chromosome Aberrations, Clone Cells pathology, Leukemia, Lymphocytic, Chronic, B-Cell genetics, Neoplasm Recurrence, Local genetics, Polymorphism, Single Nucleotide genetics
- Abstract
Genome-wide array approaches and sequencing analyses are powerful tools for identifying genetic aberrations in cancers, including leukemias and lymphomas. However, the clinical and biological significance of such aberrations and their subclonal distribution are poorly understood. Here, we present the first genome-wide array based study of pre-treatment and relapse samples from patients with B-cell chronic lymphocytic leukemia (B-CLL) that uses the computational statistical tool OncoSNP. We show that quantification of the proportion of copy number alterations (CNAs) and copy neutral loss of heterozygosity regions (cnLOHs) in each sample is feasible. Furthermore, we (i) reveal complex changes in the subclonal architecture of paired samples at relapse compared with pre-treatment, (ii) provide evidence supporting an association between increased genomic complexity and poor clinical outcome (iii) report previously undefined, recurrent CNA/cnLOH regions that expand or newly occur at relapse and therefore might harbor candidate driver genes of relapse and/or chemotherapy resistance. Our findings are likely to impact on future therapeutic strategies aimed towards selecting effective and individually tailored targeted therapies.
- Published
- 2012
- Full Text
- View/download PDF
40. A novel test for gene-ancestry interactions in genome-wide association data.
- Author
-
Davies JL, Cazier JB, Dunlop MG, Houlston RS, Tomlinson IP, and Holmes CC
- Subjects
- Colorectal Neoplasms genetics, Cytochrome b Group genetics, Humans, Oxidoreductases genetics, Phenotype, Genetic Predisposition to Disease, Genome-Wide Association Study methods, Genotype, White People genetics
- Abstract
Genome-wide association study (GWAS) data on a disease are increasingly available from multiple related populations. In this scenario, meta-analyses can improve power to detect homogeneous genetic associations, but if there exist ancestry-specific effects, via interactions on genetic background or with a causal effect that co-varies with genetic background, then these will typically be obscured. To address this issue, we have developed a robust statistical method for detecting susceptibility gene-ancestry interactions in multi-cohort GWAS based on closely-related populations. We use the leading principal components of the empirical genotype matrix to cluster individuals into "ancestry groups" and then look for evidence of heterogeneous genetic associations with disease or other trait across these clusters. Robustness is improved when there are multiple cohorts, as the signal from true gene-ancestry interactions can then be distinguished from gene-collection artefacts by comparing the observed interaction effect sizes in collection groups relative to ancestry groups. When applied to colorectal cancer, we identified a missense polymorphism in iron-absorption gene CYBRD1 that associated with disease in individuals of English, but not Scottish, ancestry. The association replicated in two additional, independently-collected data sets. Our method can be used to detect associations between genetic variants and disease that have been obscured by population genetic heterogeneity. It can be readily extended to the identification of genetic interactions on other covariates such as measured environmental exposures. We envisage our methodology being of particular interest to researchers with existing GWAS data, as ancestry groups can be easily defined and thus tested for interactions.
- Published
- 2012
- Full Text
- View/download PDF
41. Coexpression network analysis in abdominal and gluteal adipose tissue reveals regulatory genetic loci for metabolic syndrome and related phenotypes.
- Author
-
Min JL, Nicholson G, Halgrimsdottir I, Almstrup K, Petri A, Barrett A, Travers M, Rayner NW, Mägi R, Pettersson FH, Broxholme J, Neville MJ, Wills QF, Cheeseman J, Allen M, Holmes CC, Spector TD, Fleckner J, McCarthy MI, Karpe F, Lindgren CM, and Zondervan KT
- Subjects
- Body Mass Index, Chemokines genetics, Female, Genetic Loci, Genome-Wide Association Study, HLA-DRB1 Chains genetics, Humans, Intercellular Signaling Peptides and Proteins, Metabolic Syndrome pathology, Organ Specificity, Phenotype, Quantitative Trait Loci, Adipose Tissue metabolism, Gene Expression Profiling, Gene Regulatory Networks, Metabolic Syndrome genetics
- Abstract
Metabolic Syndrome (MetS) is highly prevalent and has considerable public health impact, but its underlying genetic factors remain elusive. To identify gene networks involved in MetS, we conducted whole-genome expression and genotype profiling on abdominal (ABD) and gluteal (GLU) adipose tissue, and whole blood (WB), from 29 MetS cases and 44 controls. Co-expression network analysis for each tissue independently identified nine, six, and zero MetS-associated modules of coexpressed genes in ABD, GLU, and WB, respectively. Of 8,992 probesets expressed in ABD or GLU, 685 (7.6%) were expressed in ABD and 51 (0.6%) in GLU only. Differential eigengene network analysis of 8,256 shared probesets detected 22 shared modules with high preservation across adipose depots (D(ABD-GLU) = 0.89), seven of which were associated with MetS (FDR P<0.01). The strongest associated module, significantly enriched for immune response-related processes, contained 94/620 (15%) genes with inter-depot differences. In an independent cohort of 145/141 twins with ABD and WB longitudinal expression data, median variability in ABD due to familiality was greater for MetS-associated versus un-associated modules (ABD: 0.48 versus 0.18, P = 0.08; GLU: 0.54 versus 0.20, P = 7.8×10(-4)). Cis-eQTL analysis of probesets associated with MetS (FDR P<0.01) and/or inter-depot differences (FDR P<0.01) provided evidence for 32 eQTLs. Corresponding eSNPs were tested for association with MetS-related phenotypes in two GWAS of >100,000 individuals; rs10282458, affecting expression of RARRES2 (encoding chemerin), was associated with body mass index (BMI) (P = 6.0×10(-4)); and rs2395185, affecting inter-depot differences of HLA-DRB1 expression, was associated with high-density lipoprotein (P = 8.7×10(-4)) and BMI-adjusted waist-to-hip ratio (P = 2.4×10(-4)). Since many genes and their interactions influence complex traits such as MetS, integrated analysis of genotypes and coexpression networks across multiple tissues relevant to clinical traits is an efficient strategy to identify novel associations., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2012
- Full Text
- View/download PDF
42. Accounting for control mislabeling in case-control biomarker studies.
- Author
-
Rantalainen M and Holmes CC
- Subjects
- Algorithms, Biomarkers analysis, Computer Simulation, Humans, Logistic Models, Meta-Analysis as Topic, ROC Curve, Reproducibility of Results, Risk Factors, Uncertainty, Bias, Biomarkers chemistry, Case-Control Studies
- Abstract
In biomarker discovery studies, uncertainty associated with case and control labels is often overlooked. By omitting to take into account label uncertainty, model parameters and the predictive risk can become biased, sometimes severely. The most common situation is when the control set contains an unknown number of undiagnosed, or future, cases. This has a marked impact in situations where the model needs to be well-calibrated, e.g., when the prediction performance of a biomarker panel is evaluated. Failing to account for class label uncertainty may lead to underestimation of classification performance and bias in parameter estimates. This can further impact on meta-analysis for combining evidence from multiple studies. Using a simulation study, we outline how conventional statistical models can be modified to address class label uncertainty leading to well-calibrated prediction performance estimates and reduced bias in meta-analysis. We focus on the problem of mislabeled control subjects in case-control studies, i.e., when some of the control subjects are undiagnosed cases, although the procedures we report are generic. The uncertainty in control status is a particular situation common in biomarker discovery studies in the context of genomic and molecular epidemiology, where control subjects are commonly sampled from the general population with an established expected disease incidence rate.
- Published
- 2011
- Full Text
- View/download PDF
43. Variance decomposition of protein profiles from antibody arrays using a longitudinal twin model.
- Author
-
Kato BS, Nicholson G, Neiman M, Rantalainen M, Holmes CC, Barrett A, Uhlén M, Nilsson P, Spector TD, and Schwenk JM
- Abstract
Background: The advent of affinity-based proteomics technologies for global protein profiling provides the prospect of finding new molecular biomarkers for common, multifactorial disorders. The molecular phenotypes obtained from studies on such platforms are driven by multiple sources, including genetic, environmental, and experimental components. In characterizing the contribution of different sources of variation to the measured phenotypes, the aim is to facilitate the design and interpretation of future biomedical studies employing exploratory and multiplexed technologies. Thus, biometrical genetic modelling of twin or other family data can be used to decompose the variation underlying a phenotype into biological and experimental components., Results: Using antibody suspension bead arrays and antibodies from the Human Protein Atlas, we study unfractionated serum from a longitudinal study on 154 twins. In this study, we provide a detailed description of how the variation in a molecular phenotype in terms of protein profile can be decomposed into familial i.e. genetic and common environmental; individual environmental, short-term biological and experimental components. The results show that across 69 antibodies analyzed in the study, the median proportion of the total variation explained by familial sources is 12% (IQR 1-22%), and the median proportion of the total variation attributable to experimental sources is 63% (IQR 53-72%)., Conclusion: The variability analysis of antibody arrays highlights the importance to consider variability components and their relative contributions when designing and evaluating studies for biomarker discoveries with exploratory, high-throughput and multiplexed methods.
- Published
- 2011
- Full Text
- View/download PDF
44. A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection.
- Author
-
Nicholson G, Rantalainen M, Li JV, Maher AD, Malmodin D, Ahmadi KR, Faber JH, Barrett A, Min JL, Rayner NW, Toft H, Krestyaninova M, Viksna J, Neogi SG, Dumas ME, Sarkans U, Donnelly P, Illig T, Adamski J, Suhre K, Allen M, Zondervan KT, Spector TD, Nicholson JK, Lindon JC, Baunsgaard D, Holmes E, McCarthy MI, and Holmes CC
- Subjects
- Acetyltransferases genetics, Acetyltransferases metabolism, Dimethylamines blood, Dimethylamines metabolism, Female, Haplotypes, Humans, Isobutyrates metabolism, Isobutyrates urine, Magnetic Resonance Spectroscopy, Methylamines metabolism, Methylamines urine, Polymorphism, Single Nucleotide, Genome-Wide Association Study, Metabolic Networks and Pathways genetics, Metabolome genetics, Quantitative Trait Loci genetics, Selection, Genetic
- Abstract
We have performed a metabolite quantitative trait locus (mQTL) study of the (1)H nuclear magnetic resonance spectroscopy ((1)H NMR) metabolome in humans, building on recent targeted knowledge of genetic drivers of metabolic regulation. Urine and plasma samples were collected from two cohorts of individuals of European descent, with one cohort comprised of female twins donating samples longitudinally. Sample metabolite concentrations were quantified by (1)H NMR and tested for association with genome-wide single-nucleotide polymorphisms (SNPs). Four metabolites' concentrations exhibited significant, replicable association with SNP variation (8.6×10(-11)
- Published
- 2011
- Full Text
- View/download PDF
45. Human metabolic profiles are stably controlled by genetic and environmental variation.
- Author
-
Nicholson G, Rantalainen M, Maher AD, Li JV, Malmodin D, Ahmadi KR, Faber JH, Hallgrímsdóttir IB, Barrett A, Toft H, Krestyaninova M, Viksna J, Neogi SG, Dumas ME, Sarkans U, The Molpage Consortium, Silverman BW, Donnelly P, Nicholson JK, Allen M, Zondervan KT, Lindon JC, Spector TD, McCarthy MI, Holmes E, Baunsgaard D, and Holmes CC
- Subjects
- Aged, Algorithms, Databases, Genetic, Female, Genetic Variation, Humans, Middle Aged, Models, Statistical, Research Design, Sample Size, Twins, Dizygotic genetics, Twins, Monozygotic genetics, Biomarkers blood, Biomarkers urine, Gene-Environment Interaction, Metabolome genetics, Nuclear Magnetic Resonance, Biomolecular methods, Systems Biology methods, White People genetics
- Abstract
¹H Nuclear Magnetic Resonance spectroscopy (¹H NMR) is increasingly used to measure metabolite concentrations in sets of biological samples for top-down systems biology and molecular epidemiology. For such purposes, knowledge of the sources of human variation in metabolite concentrations is valuable, but currently sparse. We conducted and analysed a study to create such a resource. In our unique design, identical and non-identical twin pairs donated plasma and urine samples longitudinally. We acquired ¹H NMR spectra on the samples, and statistically decomposed variation in metabolite concentration into familial (genetic and common-environmental), individual-environmental, and longitudinally unstable components. We estimate that stable variation, comprising familial and individual-environmental factors, accounts on average for 60% (plasma) and 47% (urine) of biological variation in ¹H NMR-detectable metabolite concentrations. Clinically predictive metabolic variation is likely nested within this stable component, so our results have implications for the effective design of biomarker-discovery studies. We provide a power-calculation method which reveals that sample sizes of a few thousand should offer sufficient statistical precision to detect ¹H NMR-based biomarkers quantifying predisposition to disease.
- Published
- 2011
- Full Text
- View/download PDF
46. MicroRNA expression in abdominal and gluteal adipose tissue is associated with mRNA expression levels and partly genetically driven.
- Author
-
Rantalainen M, Herrera BM, Nicholson G, Bowden R, Wills QF, Min JL, Neville MJ, Barrett A, Allen M, Rayner NW, Fleckner J, McCarthy MI, Zondervan KT, Karpe F, Holmes CC, and Lindgren CM
- Subjects
- Adult, Case-Control Studies, Female, Follow-Up Studies, Gene Expression Profiling, Genome-Wide Association Study, Genotype, Humans, Male, Metabolic Syndrome genetics, Oligonucleotide Array Sequence Analysis, Polymorphism, Single Nucleotide genetics, Abdominal Fat physiology, Biomarkers metabolism, Buttocks physiology, MicroRNAs genetics, Quantitative Trait Loci, RNA, Messenger genetics
- Abstract
To understand how miRNAs contribute to the molecular phenotype of adipose tissues and related traits, we performed global miRNA expression profiling in subcutaneous abdominal and gluteal adipose tissue of 70 human subjects and characterised which miRNAs were differentially expressed between these tissues. We found that 12% of the miRNAs were significantly differentially expressed between abdominal and gluteal adipose tissue (FDR adjusted p<0.05) in the primary study, of which 59 replicated in a follow-up study of 40 additional subjects. Further, 14 miRNAs were found to be associated with metabolic syndrome case-control status in abdominal tissue and three of these replicated (primary study: FDR adjusted p<0.05, replication: p<0.05 and directionally consistent effect). Genome-wide genotyping was performed in the 70 subjects to enable miRNA expression quantitative trait loci (eQTL) analysis. Candidate miRNA eQTLs were followed-up in the additional 40 subjects and six significant, independent cis-located miRNA eQTLs (primary study: p<0.001; replication: p<0.05 and directionally consistent effect) were identified. Finally, global mRNA expression profiling was performed in both tissues to enable association analysis between miRNA and target mRNA expression levels. We find 22% miRNAs in abdominal and 9% miRNAs in gluteal adipose tissue with expression levels significantly associated with the expression of corresponding target mRNAs (FDR adjusted p<0.05). Taken together, our results indicate a clear difference in the miRNA molecular phenotypic profile of abdominal and gluteal adipose tissue, that the expressions of some miRNAs are influenced by cis-located genetic variants and that miRNAs are associated with expression levels of their predicted mRNA targets.
- Published
- 2011
- Full Text
- View/download PDF
47. On the utility of graphics cards to perform massively parallel simulation of advanced Monte Carlo methods.
- Author
-
Lee A, Yau C, Giles MB, Doucet A, and Holmes CC
- Abstract
We present a case-study on the utility of graphics cards to perform massively parallel simulation of advanced Monte Carlo methods. Graphics cards, containing multiple Graphics Processing Units (GPUs), are self-contained parallel computational devices that can be housed in conventional desktop and laptop computers and can be thought of as prototypes of the next generation of many-core processors. For certain classes of population-based Monte Carlo algorithms they offer massively parallel simulation, with the added advantage over conventional distributed multi-core processors that they are cheap, easily accessible, easy to maintain, easy to code, dedicated local devices with low power consumption. On a canonical set of stochastic simulation examples including population-based Markov chain Monte Carlo methods and Sequential Monte Carlo methods, we nd speedups from 35 to 500 fold over conventional single-threaded computer code. Our findings suggest that GPUs have the potential to facilitate the growth of statistical modelling into complex data rich domains through the availability of cheap and accessible many-core computation. We believe the speedup we observe should motivate wider use of parallelizable simulation methods and greater methodological attention to their design.
- Published
- 2010
- Full Text
- View/download PDF
48. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution.
- Author
-
Heid IM, Jackson AU, Randall JC, Winkler TW, Qi L, Steinthorsdottir V, Thorleifsson G, Zillikens MC, Speliotes EK, Mägi R, Workalemahu T, White CC, Bouatia-Naji N, Harris TB, Berndt SI, Ingelsson E, Willer CJ, Weedon MN, Luan J, Vedantam S, Esko T, Kilpeläinen TO, Kutalik Z, Li S, Monda KL, Dixon AL, Holmes CC, Kaplan LM, Liang L, Min JL, Moffatt MF, Molony C, Nicholson G, Schadt EE, Zondervan KT, Feitosa MF, Ferreira T, Lango Allen H, Weyant RJ, Wheeler E, Wood AR, Estrada K, Goddard ME, Lettre G, Mangino M, Nyholt DR, Purcell S, Smith AV, Visscher PM, Yang J, McCarroll SA, Nemesh J, Voight BF, Absher D, Amin N, Aspelund T, Coin L, Glazer NL, Hayward C, Heard-Costa NL, Hottenga JJ, Johansson A, Johnson T, Kaakinen M, Kapur K, Ketkar S, Knowles JW, Kraft P, Kraja AT, Lamina C, Leitzmann MF, McKnight B, Morris AP, Ong KK, Perry JR, Peters MJ, Polasek O, Prokopenko I, Rayner NW, Ripatti S, Rivadeneira F, Robertson NR, Sanna S, Sovio U, Surakka I, Teumer A, van Wingerden S, Vitart V, Zhao JH, Cavalcanti-Proença C, Chines PS, Fisher E, Kulzer JR, Lecoeur C, Narisu N, Sandholt C, Scott LJ, Silander K, Stark K, Tammesoo ML, Teslovich TM, Timpson NJ, Watanabe RM, Welch R, Chasman DI, Cooper MN, Jansson JO, Kettunen J, Lawrence RW, Pellikka N, Perola M, Vandenput L, Alavere H, Almgren P, Atwood LD, Bennett AJ, Biffar R, Bonnycastle LL, Bornstein SR, Buchanan TA, Campbell H, Day IN, Dei M, Dörr M, Elliott P, Erdos MR, Eriksson JG, Freimer NB, Fu M, Gaget S, Geus EJ, Gjesing AP, Grallert H, Grässler J, Groves CJ, Guiducci C, Hartikainen AL, Hassanali N, Havulinna AS, Herzig KH, Hicks AA, Hui J, Igl W, Jousilahti P, Jula A, Kajantie E, Kinnunen L, Kolcic I, Koskinen S, Kovacs P, Kroemer HK, Krzelj V, Kuusisto J, Kvaloy K, Laitinen J, Lantieri O, Lathrop GM, Lokki ML, Luben RN, Ludwig B, McArdle WL, McCarthy A, Morken MA, Nelis M, Neville MJ, Paré G, Parker AN, Peden JF, Pichler I, Pietiläinen KH, Platou CG, Pouta A, Ridderstråle M, Samani NJ, Saramies J, Sinisalo J, Smit JH, Strawbridge RJ, Stringham HM, Swift AJ, Teder-Laving M, Thomson B, Usala G, van Meurs JB, van Ommen GJ, Vatin V, Volpato CB, Wallaschofski H, Walters GB, Widen E, Wild SH, Willemsen G, Witte DR, Zgaga L, Zitting P, Beilby JP, James AL, Kähönen M, Lehtimäki T, Nieminen MS, Ohlsson C, Palmer LJ, Raitakari O, Ridker PM, Stumvoll M, Tönjes A, Viikari J, Balkau B, Ben-Shlomo Y, Bergman RN, Boeing H, Smith GD, Ebrahim S, Froguel P, Hansen T, Hengstenberg C, Hveem K, Isomaa B, Jørgensen T, Karpe F, Khaw KT, Laakso M, Lawlor DA, Marre M, Meitinger T, Metspalu A, Midthjell K, Pedersen O, Salomaa V, Schwarz PE, Tuomi T, Tuomilehto J, Valle TT, Wareham NJ, Arnold AM, Beckmann JS, Bergmann S, Boerwinkle E, Boomsma DI, Caulfield MJ, Collins FS, Eiriksdottir G, Gudnason V, Gyllensten U, Hamsten A, Hattersley AT, Hofman A, Hu FB, Illig T, Iribarren C, Jarvelin MR, Kao WH, Kaprio J, Launer LJ, Munroe PB, Oostra B, Penninx BW, Pramstaller PP, Psaty BM, Quertermous T, Rissanen A, Rudan I, Shuldiner AR, Soranzo N, Spector TD, Syvanen AC, Uda M, Uitterlinden A, Völzke H, Vollenweider P, Wilson JF, Witteman JC, Wright AF, Abecasis GR, Boehnke M, Borecki IB, Deloukas P, Frayling TM, Groop LC, Haritunians T, Hunter DJ, Kaplan RC, North KE, O'Connell JR, Peltonen L, Schlessinger D, Strachan DP, Hirschhorn JN, Assimes TL, Wichmann HE, Thorsteinsdottir U, van Duijn CM, Stefansson K, Cupples LA, Loos RJ, Barroso I, McCarthy MI, Fox CS, Mohlke KL, and Lindgren CM
- Subjects
- Adipose Tissue anatomy & histology, Age Factors, Chromosome Mapping, Female, Genome, Human, Humans, Male, Meta-Analysis as Topic, Sex Characteristics, Genome-Wide Association Study, Polymorphism, Single Nucleotide, Waist-Hip Ratio
- Abstract
Waist-hip ratio (WHR) is a measure of body fat distribution and a predictor of metabolic consequences independent of overall adiposity. WHR is heritable, but few genetic variants influencing this trait have been identified. We conducted a meta-analysis of 32 genome-wide association studies for WHR adjusted for body mass index (comprising up to 77,167 participants), following up 16 loci in an additional 29 studies (comprising up to 113,636 subjects). We identified 13 new loci in or near RSPO3, VEGFA, TBX15-WARS2, NFE2L3, GRB14, DNM3-PIGC, ITPR2-SSPN, LY86, HOXC13, ADAMTS9, ZNRF3-KREMEN1, NISCH-STAB1 and CPEB4 (P = 1.9 × 10⁻⁹ to P = 1.8 × 10⁻⁴⁰) and the known signal at LYPLAL1. Seven of these loci exhibited marked sexual dimorphism, all with a stronger effect on WHR in women than men (P for sex difference = 1.9 × 10⁻³ to P = 1.2 × 10⁻¹³). These findings provide evidence for multiple loci that modulate body fat distribution independent of overall adiposity and reveal strong gene-by-sex interactions.
- Published
- 2010
- Full Text
- View/download PDF
49. A Bayesian approach using covariance of single nucleotide polymorphism data to detect differences in linkage disequilibrium patterns between groups of individuals.
- Author
-
Clark TG, Campino SG, Anastasi E, Auburn S, Teo YY, Small K, Rockett KA, Kwiatkowski DP, and Holmes CC
- Subjects
- Bayes Theorem, Genome, Genome-Wide Association Study, Genotype, Humans, Sample Size, Software, Linkage Disequilibrium, Polymorphism, Single Nucleotide
- Abstract
Motivation: Quantifying differences in linkage disequilibrium (LD) between sub-groups can highlight genetic regions or sites under selection and/or associated with disease, and may have utility in trans-ethnic mapping studies., Results: We present a novel pseudo Bayes factor (PBF) approach that assess differences in covariance of genotype frequencies from single nucleotide polymorphism (SNP) data from a genome-wide study. The magnitude of the PBF reflects the strength of evidence for a difference, while accounting for the sample size and number of SNPs, without the requirement for permutation testing to establish statistical significance. Application of the PBF to HapMap and Gambian malaria SNP data reveals regional LD differences, some known to be under selection., Availability and Implementation: The PBF approach has been implemented in the BALD (Bayesian analysis of LD differences) C++ software, and is available from http://homepages.lshtm.ac.uk/tgclark/downloads.
- Published
- 2010
- Full Text
- View/download PDF
50. Detecting interacting genetic loci with effects on quantitative traits where the nature and order of the interaction are unknown.
- Author
-
Davies JL, Hein J, and Holmes CC
- Subjects
- Algorithms, Alleles, Bayes Theorem, Computer Simulation, Environment, Genetic Markers, Genotype, Humans, Models, Genetic, Odds Ratio, Software, Genetic Loci, Models, Statistical, Quantitative Trait Loci
- Abstract
Standard techniques for single marker quantitative trait mapping perform poorly in detecting complex interacting genetic influences. When a genetic marker interacts with other genetic markers and/or environmental factors to influence a quantitative trait, a sample of individuals will show different effects according to their exposure to other interacting factors. This paper presents a Bayesian mixture model, which effectively models heterogeneous genetic effects apparent at a single marker. We compute approximate Bayes factors which provide an efficient strategy for screening genetic markers (genome-wide) for evidence of a heterogeneous effect on a quantitative trait. We present a simulation study which demonstrates that the approximation is good and provide a real data example which identifies a population-specific genetic effect on gene expression in the HapMap CEU and YRI populations. We advocate the use of the model as a strategy for identifying candidate interacting markers without any knowledge of the nature or order of the interaction. The source of heterogeneity can be modeled as an extension., ((c) 2009 Wiley-Liss, Inc.)
- Published
- 2010
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.