Óвο¼»ùÒò×éµÄת¼×éÉúÎïÐÅÏ¢·ÖÎöÄ£°å ÏÂÔØ±¾ÎÄ

(1) ½«²âÐòÐòÁÐÕû¶Î±È¶Ôµ½ÍâÏÔ×ÓÉÏ¡£ (2) ½«²âÐòÐòÁзֶαȶԵ½Á½¸öÍâÏÔ×ÓÉÏ¡£

ÎÒÃÇͳ¼ÆÁËʵÑéËù²úÉúµÄ²âÐòÐòÁеĶ¨Î»¸öÊý(Total Mapped Reads)¼°ÆäÕ¼clean readsµÄ°Ù·Ö±È£¬ÆäÖаüÀ¨¶à¸ö¶¨Î»µÄ²âÐòÐòÁиöÊý(Multiple Mapped Reads)¼°ÆäÕ¼×ÜÌ壨clean reads£©µÄ°Ù·Ö±È£¬ÒÔ¼°µ¥¸ö¶¨Î»µÄ²âÐòÐòÁиöÊý(Uniquely Mapped Reads)¼°ÆäÕ¼×ÜÌ壨clean reads£©µÄ°Ù·Ö±È¡£ 3.1 ReadsÓë²Î¿¼»ùÒò×é±È¶ÔÇé¿öͳ¼Æ

±í3.1 ReadsÓë²Î¿¼»ùÒò×é±È¶ÔÇé¿öÒ»ÀÀ±í

Sample name Total reads Total mapped Multiple mapped Uniquely mapped Read-1 Read-2 Reads map to '+' Reads map to '-' Non-splice reads Splice reads Reads mapped in proper pairs HS1 70350410 60529821 (86.04%) 606556 (0.86%) 59923265 (85.18%) 30176973 (42.9%) 29746292 (42.28%) 29930036 (42.54%) 29993229 (42.63%) 42357242 (60.21%) 17566023 (24.97%) 53795182 (76.47%) HS2 70238926 60232484 (85.75%) 633575 (0.9%) 59598909 (84.85%) 29987004 (42.69%) 29611905 (42.16%) 29783311 (42.4%) 29815598 (42.45%) 42528691 (60.55%) 17070218 (24.3%) 54428240 (77.49%) HT1 76161678 63555439 (83.45%) 714678 (0.94%) 62840761 (82.51%) 31592931 (41.48%) 31247830 (41.03%) 31409912 (41.24%) 31430849 (41.27%) 45227757 (59.38%) 17613004 (23.13%) 56181352 (73.77%) HT2 50666084 43461327 (85.78%) 450156 (0.89%) 43011171 (84.89%) 21654629 (42.74%) 21356542 (42.15%) 21476601 (42.39%) 21534570 (42.5%) 31347392 (61.87%) 11663779 (23.02%) 38524314 (76.04%) HW1 46573662 40246848 (86.42%) 389470 (0.84%) 39857378 (85.58%) 20028779 (43%) 19828599 (42.57%) 19923501 (42.78%) 19933877 (42.8%) 28062847 (60.25%) 11794531 (25.32%) 36101400 (77.51%) HW2 40543118 34971284 (86.26%) 335509 (0.83%) 34635775 (85.37%) 17411209 (43.02%) 17224566 (42.35%) 17289330 (42.61%) 17346445 (42.76%) 24725216 (61.1%) 9910559 (24.26%) 31246362 (77.25%) ±È¶Ô½á¹ûͳ¼ÆÏêϸÄÚÈÝÈçÏ£º

(1) Total reads£º²âÐòÐòÁо­¹ý²âÐòÊý¾Ý¹ýÂ˺óµÄÊýÁ¿Í³¼Æ(Clean data)¡£ (2) Total mapped£ºÄܶ¨Î»µ½»ùÒò×éÉϵIJâÐòÐòÁеÄÊýÁ¿µÄͳ¼Æ£»Ò»°ãÇé¿öÏ£¬Èç¹û²»´æÔÚÎÛȾ²¢ÇҲο¼»ùÒò×éÑ¡ÔñºÏÊʵÄÇé¿öÏ£¬Õⲿ·ÖÊý¾ÝµÄ°Ù·Ö±È´óÓÚ 70%¡£

(3) Multiple mapped£ºÔڲο¼ÐòÁÐÉÏÓжà¸ö±È¶ÔλÖõIJâÐòÐòÁеÄÊýÁ¿Í³¼Æ£»Õⲿ·ÖÊý¾ÝµÄ°Ù·Ö±ÈÒ»°ã»áСÓÚ10%¡£

(4) Uniquely mapped£ºÔڲο¼ÐòÁÐÉÏÓÐΨһ±È¶ÔλÖõIJâÐòÐòÁеÄÊýÁ¿Í³

¼Æ¡£

(5) Reads map to '+'£¬Reads map to '-'£º²âÐòÐòÁбȶԵ½»ùÒò×éÉÏÕýÁ´ºÍ¸ºÁ´µÄͳ¼Æ¡£

(6) Splice reads£º(2)ÖУ¬·Ö¶Î±È¶Ôµ½Á½¸öÍâÏÔ×ÓÉϵIJâÐòÐòÁÐ(Ò²³ÆÎªJunction reads)µÄͳ¼Æ£¬Non-splice readsΪÕû¶Î±È¶Ôµ½ÍâÏÔ×ӵĽ«²âÐòÐòÁеÄͳ¼Æ£¬Splice readsµÄ°Ù·Ö±ÈÈ¡¾öÓÚ²âÐòƬ¶ÎµÄ³¤¶È¡£ 3.2 ReadsÔڲο¼»ùÒò×é²»Í¬ÇøÓòµÄ·Ö²¼Çé¿ö

¶ÔTotal mapped readsµÄ±È¶Ôµ½»ùÒò×éÉϵĸ÷¸ö²¿·ÖµÄÇé¿ö½øÐÐͳ¼Æ£¬¶¨Î»ÇøÓò·ÖΪExon(ÍâÏÔ×Ó)¡¢Intron(ÄÚº¬×Ó)ºÍIntergenic(»ùÒò¼ä¸ôÇøÓò)¡£ Õý³£Çé¿öÏ£¬Exon (ÍâÏÔ×Ó)ÇøÓòµÄ²âÐòÐòÁж¨Î»µÄ°Ù·Ö±Èº¬Á¿Ó¦¸Ã×î¸ß£¬¶¨Î»µ½Intron (ÄÚº¬×Ó) ÇøÓòµÄ²âÐòÐòÁпÉÄÜÊÇÓÉÓڷdzÉÊìµÄmRNAµÄÎÛȾ»òÕß»ùÒò×é×¢ÊͲ»ÍêÈ«µ¼Öµģ¬¶ø¶¨Î»µ½Intergenic(»ùÒò¼ä¸ôÇøÓò)µÄ²âÐòÐòÁпÉÄÜÊÇÒòΪ»ùÒò×é×¢ÊͲ»ÍêÈ«ÒÔ¼°±³¾°ÔëÒô¡£

ͼ3.2 ReadsÔڲο¼»ùÒò×é²»Í¬ÇøÓòµÄ·Ö²¼Çé¿ö

3.3 ReadsÔÚȾɫÌåÉϵÄÃܶȷֲ¼Çé¿ö

¶ÔTotal mapped readsµÄ±È¶Ôµ½»ùÒò×éÉϵĸ÷¸öȾɫÌ壨·ÖÕý¸ºÁ´£©µÄÃܶȽøÐÐͳ¼Æ£¬ÈçÏÂͼËùʾ£¬¾ßÌå×÷ͼµÄ·½·¨ÎªÓû¬¶¯´°¿Ú(window size)Ϊ1K£¬¼ÆËã´°¿ÚÄÚ²¿±È¶Ôµ½¼î»ùλÖÃÉϵÄreadsµÄÖÐλÊý£¬²¢×ª»¯³É log2 ¡£Õý³£Çé¿öÏ£¬Õû¸öȾɫÌ峤¶ÈÔ½³¤£¬¸ÃȾɫÌåÄÚ²¿¶¨Î»µÄreads×ÜÊý»áÔ½¶à(Marquez et al.)¡£´Ó¶¨Î»µ½È¾É«ÌåÉϵÄreadsÊýÓëȾɫÌ峤¶ÈµÄ¹ØÏµÍ¼ÖУ¬¿ÉÒÔ¸ü¼ÓÖ±¹Û¿´

³öȾɫÌ峤¶ÈºÍreads×ÜÊýµÄ¹ØÏµ¡£