humann3的安装与使用:解决metaphlan3无法下载数据库的问题。

tech2022-09-18  127

humann3的安装与使用:解决metaphlan3无法下载数据库的问题。

关于humann3的安装,建议按照官方推荐的方式:http://huttenhower.sph.harvard.edu/humann3

conda create --name humann3 python=3.7 conda activate humann3 #下面这一步很重要,顺序也很重要: conda config --add channels defaults conda config --add channels bioconda conda config --add channels conda-forge conda config --add channels biobakery

conda install humann -c biobakery #以上步骤很重要,不然安装的版本可能不一致,特别是metaphlan的版本,如果直接用conda安装很可能是3.0,现在这个软件已经更新到了3.0.2以上了,区别还是挺大的,尤其是在数据库配置这块。如果不是metaphlan 3.0.2以上版本,以下数据库配置方法并不适用。

安装完成之后,你运行 humann_test 这肯定都是OK的。

接下来需要下载核心数据库: 核酸数据库:humann_databases --download chocophlan full /path/to/databases --update-config yes

蛋白数据库:humann_databases --download uniref uniref90_diamond /path/to/databases --update-config yes

utility_mapping文件:humann_databases --download utility_mapping full /path/to/databases --update-config yes

如果说已经下载需要更改位置,可以运行 humann_config --update $section $name $value 运行humann_config就明白这三个参数分别指什么了: HUMAnN Configuration ( Section : Name = Value ) database_folders : nucleotide = ~/humann3/chocophlan database_folders : protein = ~humann3/uniref database_folders : utility_mapping = ~humann3/utility_mapping run_modes : resume = False run_modes : verbose = False run_modes : bypass_prescreen = False run_modes : bypass_nucleotide_index = False run_modes : bypass_nucleotide_search = False run_modes : bypass_translated_search = False run_modes : threads = 1 alignment_settings : evalue_threshold = 1.0 alignment_settings : prescreen_threshold = 0.01 alignment_settings : translated_subject_coverage_threshold = 50.0 alignment_settings : translated_query_coverage_threshold = 90.0 alignment_settings : nucleotide_subject_coverage_threshold = 50.0 alignment_settings : nucleotide_query_coverage_threshold = 90.0 output_format : output_max_decimals = 10 output_format : remove_stratified_output = False output_format : remove_column_description_output = False 如你需要更改核酸数据库位置,运行: humann_config --update database_folders nucleotide $target directory 接下来你需要配置metaphlan3的数据库,其实这一步在humann3运行的过程中会自动配置,但是由于作者将数据库存放在了dropbox上,国内无法正常下载,所以导致运行失败。为了运行成功,国内用户需要提前配置好metaphlan3的数据库。需要运行: $ metaphlan --install --bowtie2db $target directory 这个里面有几个注意事项: ① database folder建议用默认的位置~miniconda3/envs/humann3/lib/python3.7/site-packages/metaphlan/metaphlan_database. ②作者提供了三个网址下载数据库, 只有zenodo国内可以自由下载,并且特别让人烦躁的是,必须的文件file_list.txt在zenodo里面并没有。 可以考虑将下面的内容复制,保存为file_list.txt: bcftools https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AACPUjNhnsIIJn16-ww6-MWOa/bcftools?dl=1 metaphlan2_homebrew_counter.txt https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAD6ImeA91we2nBWBhpfaEOqa/metaphlan2_homebrew_counter.txt?dl=1 mpa_latest https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAyoJpOgcjop41VIHAGWIVLa/mpa_latest?dl=1 mpa_v20_m200_marker_info.txt.bz2 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AABKU_RAK5yOzyhV27NpOduDa/mpa_v20_m200_marker_info.txt.bz2?dl=1 mpa_v20_m200.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AADS8nukx3dSoiR82OHw6dOka/mpa_v20_m200.md5?dl=1 mpa_v20_m200.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAASBOj-2gAbA53cV1bXBULYa/mpa_v20_m200.tar?dl=1 mpa_v29_CHOCOPhlAn_201901_marker_info.txt.bz2 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AADhE2Ur7JqirifOdgi4fGQEa/mpa_v29_CHOCOPhlAn_201901_marker_info.txt.bz2?dl=1 mpa_v29_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AADdxAWsjLPLjy10VICgSAEPa/mpa_v29_CHOCOPhlAn_201901.md5?dl=1 mpa_v29_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAACeczBU6P9lIBD4ZYtxwKva/mpa_v29_CHOCOPhlAn_201901.tar?dl=1 mpa_v292_CHOCOPhlAn_201901_marker_info.txt.bz2 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AADoxBNynVopWt2shYYKQ2Mba/mpa_v292_CHOCOPhlAn_201901_marker_info.txt.bz2?dl=1 mpa_v292_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAApZLRLD0Bkb86bvH-Y3-tUa/mpa_v292_CHOCOPhlAn_201901.md5?dl=1 mpa_v292_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AABK4Zns2PYUh_R-sLGEx_Bza/mpa_v292_CHOCOPhlAn_201901.tar?dl=1 mpa_v293_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAsg2PN5Ng6uwHnEemlqo3-a/mpa_v293_CHOCOPhlAn_201901.md5?dl=1 mpa_v293_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AABFg8C3gMyNpYNTAY5PcSONa/mpa_v293_CHOCOPhlAn_201901.tar?dl=1 mpa_v294_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAA6KBc-nd8_C4bJOPVjzWW7a/mpa_v294_CHOCOPhlAn_201901.md5?dl=1 mpa_v294_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAJSiG2qkfNMcYzhvz4Rxsga/mpa_v294_CHOCOPhlAn_201901.tar?dl=1 mpa_v295_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AACIWLuO_0ixZsxiSZZG-4M1a/mpa_v295_CHOCOPhlAn_201901.md5?dl=1 mpa_v295_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAA0y5qvPjJdxVw86Ovjinipa/mpa_v295_CHOCOPhlAn_201901.tar?dl=1 mpa_v296_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAbRMEnsewv_pHFsLv2Be0va/mpa_v296_CHOCOPhlAn_201901.md5?dl=1 mpa_v296_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AABAD51gd3wr0___2MWO0xD-a/mpa_v296_CHOCOPhlAn_201901.tar?dl=1 mpa_v296_CHOCOPhlAn_201901_marker_info.txt.bz2 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAv_ShZiz7pNTaT_YONJTF7a/mpa_v296_CHOCOPhlAn_201901_marker_info.txt.bz2?dl=1 mpa_v30_CHOCOPhlAn_201901_marker_info.txt.bz2 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAlyQITZuUCtBUJxpxhIroIa/mpa_v30_CHOCOPhlAn_201901_marker_info.txt.bz2?dl=1 mpa_v30_CHOCOPhlAn_201901.tar https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AADlxibskzbPHPoDl6S-FyKka/mpa_v30_CHOCOPhlAn_201901.tar?dl=1 mpa_v30_CHOCOPhlAn_201901.md5 https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AACTzoUYDqZps8u2JqWCNCODa/mpa_v30_CHOCOPhlAn_201901.md5?dl=1 SRS019033.fastq https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AACh1NQExDk39RXzZOyTPmQwa/SRS019033.fastq?dl=1 strainphlan_homebrew_counter.txt https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAZO5uydQ2QQkOSkYu5DxRha/strainphlan_homebrew_counter.txt?dl=1 然后放到database folder里面。 ③cd到database folder里面,通过zenodo下载所需要的三个关键文件: wget https://zenodo.org/record/3957592/files/mpa_latest wget https://zenodo.org/record/3957592/files/mpa_v30_CHOCOPhlAn_201901.md5 wget https://zenodo.org/record/3957592/files/mpa_v30_CHOCOPhlAn_201901.tar 切记不要去帮忙解压缩,只需要这三个文件就行了,命令里面都写好了。 然后运行 metaphlan --install --bowtie2db $target directory 大概10-20分钟之后就会提示数据库安装成功了。 如果因为某种原因这个过程中断了,切记删掉database folder里面的东西,只保留上面的四个文件,再重新运行一次,不然会提示数据库已经安装成功,其实无法识别的问题,更难受。

大家可参考https://github.com/biobakery/MetaPhlAn/wiki/MetaPhlAn-3.0

最新回复(0)