RNA-seq入門RNA-seq実験では何を測定しているのか?実験設計における重要な考慮事項RNA-seqデータの定量化:リードからカウント行列への変換プロセス参照配列を見つけるには本ワークショップではどのような方向性を目指していくのでしょうか?


Figure 1

分子生物学のセントラルドグマの一部を図解したもので、DNAがRNAに転写され、イントロン配列が除去される過程を示しています

Figure 2

RNA-seq実験の主要な実験手順を図解したもの

Figure 3

実験から得られる測定値に影響を与える多様な要因を、処理効果・生物学的要因・技術的要因・誤差要因に分類した図

Figure 4

シーケンサーによって生成されたリードセットと、ゲノム配列およびトランスクリプトーム参照配列の概念図

Figure 5

MAプロットの具体例ヒートマップの具体例


RStudioプロジェクトと実験データ


Figure 1

Your working directory should look like this
Your working directory should look like this

Figure 2

A new .Rproj file should be created in your chosen working directory.
A new .Rproj file should be created in your chosen working directory.

Figure 3

Your R working directory should now be set to where the .Rproj file resides.
Your R working directory should now be set to where the .Rproj file resides.

Figure 4

A file named GSE96870_counts_cerebellum.csv should now reside in the data folder.
A file named GSE96870_counts_cerebellum.csv should now reside in the data folder.

Rに量的データをインポートしてアノテーションを付ける


Figure 1

Schematic showing the composition of a SummarizedExperiment object, with three assay matrices of equal dimension, rowData with feature annotations, colData with sample annotations, and a metadata list.

探索的解析と品質管理


Figure 1

Barplot with total count on the y-axis and sample name on the x-axis, with bars colored by the group annotation. The total count varies between approximately 32 and 43 million.

Figure 2

Scatterplot with library size on the x-axis and size factor on the y-axis, showing a high correlation between the two variables.

Figure 3

Hexagonal heatmap with the mean count on the x-axis and the standard deviation of the count on the y-axis, showing a generally increasing standard deviation with increasing mean. The density of points is highest for low count values.

Figure 4

Hexagonal heatmap with the mean variance-stabilized values on the x-axis and the standard deviation of these on the y-axis. The trend is generally flat, with no clear association between the mean and standard deviation.

Figure 5

Heatmap of Euclidean distances between all pairs of samples, with hierarchical cluster dendrogram for both rows and columns. Samples from day 8 cluster separately from samples from days 0 and 4. Within days 0 and 4, the main clustering is instead by sex.

Figure 6

Scatterplot of samples projected onto the first two principal components, colored by sex and shaped according to the experimental day. The main separation along PC1 is between male and female samples. The main separation along PC2 is between samples from day 8 and samples from days 0 and 4.

Figure 7

Scatterplot of samples projected onto the first two principal components, colored by a hypothetical sample ID annotation and shaped according to a hypothetical experimental day annotation. In the plot, samples with the same sample ID tend to cluster together.

Figure 8

Scatterplot of samples projected onto the first two principal components of the variance-stabilized data, colored by library size. The library sizes are between approximately 32.5 and 42.5 million. There is no strong association between the library sizes and the principal components.

Figure 9

Scatterplot of samples projected onto the first two principal components of the count matrix, colored by library size. The library sizes are between approximately 32.5 and 42.5 million. The first principal component is strongly correlated with the library size.

Differential expression 解析


Figure 1

Scatterplot with the mean of normalized counts on the x-axis and the dispersion on the y-axis. The plot shows black dots corresponding to gene-wise estimates of the dispersion, a red line corresponding to the fitted trend, and blue dots corresponding to the final dispersion estimates. There is a general trend of decreasing dispersion with increasing mean normalized counts.

Figure 2

MAプロット:x軸に正規化後の平均カウント数、y軸に対数倍率変化量を表示。有意に発現変動した遺伝子は青色で表示されています。平均正規化カウント数が少ない遺伝子ほど、対数倍率変化量の範囲が広くなっています。

Figure 3

MAプロット:x軸に正規化後の平均カウント数、y軸に縮小処理された対数倍率変化量を表示。有意に発現変動した遺伝子は青色で表示されています。平均正規化カウント数が少ない遺伝子の対数倍率変化量は、ほとんどがゼロに近い値に縮小されています。

Figure 4

Heatmap showing the vsd-transformed expression levels for the ten most significantly differentially expressed genes over time, in all the samples.

design matricesの詳細な解析


Figure 1


Figure 2


Figure 3


Figure 4


Figure 5


Figure 6


Figure 7


Figure 8


Figure 9


Figure 10


Figure 11


Figure 12


Figure 13


Figure 14


Figure 15


Figure 16


Figure 17


Figure 18


遺伝子セットエンリッチメント解析


Figure 1


Figure 2


Figure 3


Figure 4


Figure 5


Figure 6


Figure 7


Figure 8


Figure 9


Figure 10


Figure 11


Next steps