跳转到主要内容

多种基于Python的生物信息学工具

项目描述

blindschleiche

一组生物信息学/序列工具,用于我的研究,也许对您也有用。

DOI doi:10.5281/zenodo.10049825

安装

pip install blindschleiche
# or for the current main branch:
# pip install git+https://github.com/kdm9/blindschleiche.git

用法

USAGE: blsl <subtool> [options...]


Where <subtool> is one of:

  telogrep:             Search contigs for known telomere repeats
  n50:                  Calculate N50 and total length of a set of contigs
  falen:                Tabulate the lengths of sequences in a FASTA file
  mask2bed:             The inverse of bedtools maskfasta: softmasked fasta -> unmasked fasta + mask.bed
  pansn-rename:         Add, remove, or modify PanSN-style prefixes to contig/chromosome names in references
  genigvjs:             Generate a simple IGV.js visualisation of some bioinf files.
  ildemux:              Demultiplex modern illumina reads from read headers.
  ilsample:             Sample a fraction of read pairs from an interleaved fastq file
  regionbed:            Make a bed/region file of genome windows
  uniref-acc2taxid:     Make a ncbi-style acc2taxid.map file for a uniref fasta
  nstitch:              Combine R1 + R2 into single sequences, with an N in the middle
  gg2k:                 Summarise a table with GreenGenes-style lineages into a kraken-style report.
  equalbestblast:       Output only the best blast hits.
  tabcat:               Concatenate table (c/tsv) files, adding the filename as a column
  esearchandfetch:      Use the Entrez API to search for and download something. A CLI companion to the NCBI search box
  deepclust2fa:         Split a .faa by the clusters diamond deepclust finds
  farename:             Rename sequences in a fasta file sequentially
  gffcat:               Concatenate GFF3 files, resepcting header lines and FASTA sections
  gffparse:             Format a GFF sanely
  gffcsqify:            Format a reasonably compliant GFF for use with bcftools csq
  gfftagsane:           Sanitise a messy gff attribute column to just simple tags 
  liftoff-gff3:         Obtain an actually-useful GFF3 from Liftoff by fixing basic GFF3 format errors
  ebiosra2rl2s:         INTERNAL: MPI Tübingen tool. Make a runlib-to-sample map table from ebio sra files
  galhist:              Make a summary histogram of git-annex-list output
  pairslash:            Add an old-style /1 /2 pair indicator to paired-end fastq files
  vcfstats:             Use bcftools to calculate various statistics, outputing an R-ready table
  vcfparallel:          Parallelise a bcf processing pipeline across regions
  shannon-entropy:      Calculate Shannon's entropy (in bits) at each column of one or more alignments
  fastasanitiser:       Sanitise fasta IDs to something sane, then back again
  tidyqc:               What if MultiQC was in the tidyverse? (and much worse)
  jsonl2csv:            Parse jsonlines into a C/TSV
  help:                 Print this help message


Use blsl subtool --help to get help about a specific tool

为什么叫Blindschleiche?

  1. 它们是了不起的动物
  2. 它们的英文名字是Slow Worm,这个名字很贴切,因为这些Python中的低性能工具。
  3. 所有用Python实现的工具都必须用蛇的玩笑命名,它们有点像蛇(实际上不是,它们是无足蜥蜴)

项目详情


下载文件

下载适用于您的平台文件。如果您不确定选择哪个,请了解有关安装软件包的更多信息。

源分布

blindschleiche-0.3.1.tar.gz (33.1 kB 查看哈希值)

上传时间:

构建分布

blindschleiche-0.3.1-py3-none-any.whl (47.0 kB 查看哈希值)

上传于 Python 3

支持者