将FASTA读入数据帧并提取FASTA文件的子序列 [英] Read FASTA into a dataframe and extract subsequences of FASTA file

查看：156 发布时间：2020/9/21 3:09:02 r subset bioinformatics fasta

本文介绍了将FASTA读入数据帧并提取FASTA文件的子序列的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个小的 fasta DNA序列文件，看起来像这样:

I have a small fasta file of DNA sequences which looks like this:

>NM_000016 700 200 234
ACATATTGGAGGCCGAAACAATGAGGCGTGATCAACTCAGTATATCAC

>NM_000775 700 124 236
CTAACCTCTCCCAGTGTGGAACCTCTATCTCATGAGAAAGCTGGGATGAG

>NM_003820 700 111 222
ATTTCCTCCTGCTGCCCGGGAGGTAACACCCTGGACCCCTGGAGTCTGCA

问题:

1)如何将Fasta文件作为数据帧读入R，其中每一行是一个序列记录，第一列是refseqID，第二列是序列.

1) How can I read this fasta file into R as a dataframe where each row is a sequence record, the 1st column is the refseqID and the 2nd column is the sequence.

2)如何在(开始，结束)位置提取子序列?

2) How to extract subsequence at (start, end) location?

NM_000016 1  3 #"ACA"
NM_000775 2  6 #"TAACC"
NM_003820 3  5 #"TTC"

将FASTA读入数据帧并提取FASTA文件的子序列 [英] Read FASTA into a dataframe and extract subsequences of FASTA file

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

将FASTA读入数据帧并提取FASTA文件的子序列 [英] Read FASTA into a dataframe and extract subsequences of FASTA file

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭