Bioinformatics Tutorial
Files Needed
  • Getting Started
    • Setup
    • Run jobs in a Docker
    • Run jobs in a cluster [Advanced]
  • Part I. Programming Skills
    • 1.Linux
      • 1.1.Basic Command
      • 1.2.Practice Guide
      • 1.3.Linux Bash
    • 2.R
      • 2.1.R Basics
      • 2.2.Plot with R
    • 3.Python
  • PART II. BASIC ANALYSES
    • 1.Blast
    • 2.Conservation Analysis
    • 3.Function Analysis
      • 3.1.GO
      • 3.2.KEGG
      • 3.3.GSEA
    • 4.Clinical Analyses
      • 4.1.Survival Analysis
  • Part III. NGS DATA ANALYSES
    • 1.Mapping
      • 1.1 Genome Browser
      • 1.2 bedtools and samtools
    • 2.RNA-seq
      • 2.1.Expression Matrix
      • 2.2.Differential Expression with Cufflinks
      • 2.3.Differential Expression with DEseq2 and edgeR
    • 3.ChIP-seq
    • 4.Motif
      • 4.1.Sequence Motif
      • 4.2.Structure Motif
    • 5.RNA Network
      • 5.1.Co-expression Network
      • 5.2.miRNA Targets
      • 5.3. CLIP-seq (RNA-Protein Interaction)
    • 6.RNA Regulation - I
      • 6.1.Alternative Splicing
      • 6.2.APA (Alternative Polyadenylation)
      • 6.3.Chimeric RNA
      • 6.4.RNA Editing
      • 6.5.SNV/INDEL
    • 7.RNA Regulation - II
      • 7.1.Translation: Ribo-seq
      • 7.2.RNA Structure
    • 8.cfDNA
      • 8.1.Basic cfDNA-seq Analyses
  • Part IV. MACHINE LEARNING
    • 1.Machine Learning Basics
      • 1.1 Data Pre-processing
      • 1.2 Data Visualization & Dimension Reduction
      • 1.3 Feature Extraction and Selection
      • 1.4 Machine Learning Classifiers/Models
      • 1.5 Performance Evaluation
    • 2.Machine Learning with R
    • 3.Machine Learning with Python
  • Part V. Assignments
    • 1.Precision Medicine - exSEEK
      • Help
      • Archive: Version 2018
        • 1.1.Data Introduction
        • 1.2.Requirement
        • 1.3.Helps
    • 2.RNA Regulation - RiboShape
      • 2.0.Programming Tools
      • 2.1.RNA-seq Analysis
      • 2.2.Ribo-seq Analysis
      • 2.3.SHAPE Data Analysis
      • 2.4.Integration
    • 3.RNA Regulation - dsRNA
    • 4.Single Cell Data Analysis
      • Help
  • 5.Model Programming
  • Appendix
    • Appendix I. Keep Learning
    • Appendix II. Databases & Servers
    • Appendix III. How to Backup
    • Appendix IV. Teaching Materials
    • Appendix V. Software and Tools
    • Appendix VI. Genome Annotations
Powered by GitBook
On this page
  • 1) Features from conventional RNA-seq
  • 1a) Introduction
  • 1b) Pipelines
  • 1c) Literatures
  • 2) Files Needed
  • Method 1: Use Docker
  • Method 2: Directly Download

Was this helpful?

Edit on GitHub
  1. Part III. NGS DATA ANALYSES

6.RNA Regulation - I

Previous5.3. CLIP-seq (RNA-Protein Interaction)Next6.1.Alternative Splicing

Last updated 2 years ago

Was this helpful?

1) Features from conventional RNA-seq

1a) Introduction

在本节中,我们将介绍一系列从二代测序数据出发分析RNA调控的生物信息学方法。基因表达的调控指的是细胞通过改变不同蛋白的表达水平来响应外界的变化。这里的RNA调控指的是在RNA水平上的基因调控。利用二代测序来对RNA调控进行分析,有些利用常规的RNA-seq数据就可以完成,例如:

  • 在不同组织中或不同条件下,真核生物的转录本普遍存在可变剪接(alternative splicing),这就可能导致蛋白产物也相应的发生变化。除此之外,还存在一些跨转录本的剪接事件,导致不同基因的转录本被嵌合到一条RNA中,产生嵌合RNA(chimeric RNA);以及反向剪接(back splicing)事件,使得一部分转录本首尾相连形成环状(circular RNA)等。这些剪接产物有些会被表达出来,直接导致蛋白的改变;也有一些会发挥调控作用。常规RNA-seq数据就可以用来做以上这些剪接事件的分析。

  • 真核生物mRNA的polyA加尾的位置也会有可变性,这一般被称为alternative poly adenylation(APA)。这样产生的转录本3' UTR就会存在序列差异,从而可能导致mRNA翻译效率,稳定性,定位等发生变化,最终调控蛋白的表达。常规RNA-seq数据可以用来做APA的分析,也有专门针对APA分析设计的测序方法,如PAT-seq等。

1b) Pipelines

Multiple features can be derived from conventional RNA-seq data:

  1. (Expression Level)

  2. Alternative Splicing (AS)

  3. Alternative Polyadenylation (APA)

  4. Chimeric RNA

  5. RNA editing

  6. Mutation (SNV/INDEL of DNA)

  7. Alternative Promoter

  8. Allele-specific expression (ASE)

  9. TE-RNA (Transposable Elements, i.e., Repeats)

  10. mbRNA (microbial RNA)

  11. ...

  • Basic methods of RNA feature calculation for features 2-6 will be introduced in 6.1-6.5 of this book.

  • Advanced analysis pipelines for more RNA features are introduced in Github by Lulab members:

1c) Literatures

2) Files Needed

Method 1: Use Docker

(1) 6.1 Alternative Splicing: bioinfo_PartI-PartII-PartIII1-3.tar.gz

#与PART I + II类似
docker load -i ~/Desktop/bioinfo_PartI-PartII-PartIII1-3.tar.gz  #load the image
docker run --name=bioinfo_tsinghua -dt  -h bioinfo_docker --restart unless-stopped -v ~/Desktop/bioinfo_tsinghua_share:/home/test/share xfliu1995/bioinfo_tsinghua:2

docker exec -it rnaeditor bash

cd /home/test/alter-spl/

(2) 6.2 APA, 6.8 Ribo-seq, 6.9 Structure-seq:bioinfo_tsinghua_6.2_apa_6.3_ribo_6.4_structure.tar.gz

docker load -i ~/Desktop/bioinfo_tsinghua_6.2_apa_6.3_ribo_6.4_structure.tar.gz

#注意:下面的文件挂载目录为~/Desktop/bioinfo_tsinghua_share, 自己运行时记得改为自己新建的一个目录名称。
docker run --name=rnaregulation -dt -h bioinfo_docker --restart unless-stopped -v ~/Desktop/bioinfo_tsinghua_share:/home/test/share gangxu/bioinfo_tsinghua_6.2_apa_6.3_ribo_6.4_structure:latest

docker exec -u root -it rnaregulation bash


# APA
cd /home/test/rna_regulation/apa

# Ribo-seq
cd /home/test/rna_regulation/ribo-wave

# Structure-seq
cd /home/test/rna_regulation/structure_seq

(3) 6.3.Chimeric RNA Detection: bioinfo_chimeric.tar.gz

docker load -i ~/Desktop/bioinfo_chimeric.tar.gz
mv ~/Downloads/ref_genome.fa.star.idx ~/Downloads/ctat_genome_lib_build_X_docker
docker run -dt -v ~/Downloads/ctat_genome_lib_build_X_docker:/data --name=bioinfo_starfusion gangxu/starfusion:latest

(4) 6.4 RNA Editing: bioinfo_rnaeditor.1.8.tar.gz

docker load -i ~/Desktop/bioinfo_rnaeditor.tar.gz

#注意:下面的文件挂载目录为/Users/xugang/Downloads/data2, 自己运行时记得改为自己新建的一个目录名称。
docker run --name=rnaeditor -dt -h bioinfo_docker --restart unless-stopped -v /Users/xugang/Downloads/data2:/data2 gangxu/rnaeditor:1.8

docker exec -it rnaeditor bash

cd /home/test

(5) 6.6.SNV/INDEL Detection: bioinfo_snv.tar.gz

docker load -i ~/Downloads/bioinfo_snv.tar.gz
docker run -dt --name=snv -v ~/Downloads/data:/data gangxu/snv:2.0
docker exec -it snv bash

Method 2: Directly Download

by YF Jin

by YH Tao

docker images的下载链接如所示:

需要下载原始数据和基因组文件,请把 ref_genome.fa.star.idx 移到 ctat_genome_lib_build_X_docker文件中。下载解压后需要挂载。

如果不使用docker,也可以直接从 中的Files/ 路径下的相应文件夹中下载所需要的文件。

Abundance
RNA analyses
RNA/Multi-Omics analyses
RNA Regulation - Lu Lab Docs
cfRNA-seq Analysis - Lu Lab Docs
ctat_genome_lib_build_X_docker.zip,ref_genome.fa.star.idx.zip
Files needed
附表