Package: seqinr 4.2-44

Simon Penel

seqinr: Biological Sequences Retrieval and Analysis

Exploratory data analysis and data visualization for biological sequence (DNA and protein) data. Seqinr includes utilities for sequence data management under the ACNUC system described in Gouy, M. et al. (1984) Nucleic Acids Res. 12:121-127 <doi:10.1093/nar/12.1Part1.121>.

Authors:Delphine Charif [aut], Olivier Clerc [ctb], Carolin Frank [ctb], Jean R. Lobry [aut, cph], Anamaria Necşulea [ctb], Leonor Palmeira [ctb], Simon Penel [cre], Guy Perrière [ctb]

seqinr_4.2-44.tar.gz
seqinr_4.2-44.zip(r-4.7)seqinr_4.2-44.zip(r-4.6)seqinr_4.2-44.zip(r-4.5)
seqinr_4.2-44.tgz(r-4.6-x86_64)seqinr_4.2-44.tgz(r-4.6-arm64)seqinr_4.2-44.tgz(r-4.5-x86_64)seqinr_4.2-44.tgz(r-4.5-arm64)
seqinr_4.2-44.tar.gz(r-4.7-arm64)seqinr_4.2-44.tar.gz(r-4.7-x86_64)seqinr_4.2-44.tar.gz(r-4.6-arm64)seqinr_4.2-44.tar.gz(r-4.6-x86_64)
seqinr_4.2-44.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
seqinr/json (API)
NEWS

# Install 'seqinr' in R:
install.packages('seqinr', repos = c('https://lbbe-software.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/lbbe-software/seqinr/issues

Uses libs:
  • zlib– Compression library
Datasets:
  • aacost - Aerobic cost of amino-acids in Escherichia coli and G+C classes
  • aaindex - List of 544 physicochemical and biological properties for the 20 amino-acids
  • AnoukResult - Expected numeric results for Ka and Ks computation
  • caitab - Codon Adaptation Index (CAI) w tables
  • chargaff - Base composition in ssDNA for 7 bacterial DNA
  • clustal - Example of results obtained after a call to read.alignment
  • dinucl - Mean zscore on 242 complete bacterial chromosomes
  • ec999 - 999 coding sequences from E. coli
  • ECH - Forensic Genetic Profile Allelic Ladder Raw Data
  • EXP - Vectors of coefficients to compute linear forms.
  • fasta - Example of results obtained after a call to read.alignment
  • gcO2 - GC content and aerobiosis in bacteria
  • gcT - GC content and temperature in bacteria
  • gs500liz - GS500LIZ size standards
  • identifiler - Identifiler allele names
  • JLO - Forensic Genetic Profile Raw Data
  • kaksTorture - Expected numeric results for Ka and Ks in extreme cases
  • m16j - Fragment of the E. coli chromosome
  • mase - Example of results obtained after a call to read.alignment
  • msf - Example of results obtained after a call to read.alignment
  • phylip - Example of results obtained after a call to read.alignment
  • pK - PK values for the side chain of charged amino acids from various sources
  • prochlo - Zscore on three strains of Prochlorococcus marinus
  • revaligntest - Three aligned nucleic acid sequences
  • SEQINR.UTIL - Utility data for seqinr
  • SEQINR.UTIL - Utility data for seqinr
  • toyaa - A toy example of amino-acid counts in three proteins
  • toycodon - A toy example of codon counts in three coding sequences
  • waterabs - Light absorption by the water column

On CRAN:

Conda:

zlib

11.79 score 5 stars 128 packages 4.9k scripts 33k downloads 110 mentions 204 exports 9 dependencies

Last updated from:c84024d1dc. Checks:13 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64OK150
linux-devel-x86_64OK158
source / vignettesOK186
linux-release-arm64OK144
linux-release-x86_64OK141
macos-release-arm64OK102
macos-release-x86_64OK231
macos-oldrel-arm64OK135
macos-oldrel-x86_64OK191
windows-develOK138
windows-releaseOK122
windows-oldrelOK120
wasm-releaseOK121

Exports:.seqinrEnvaaaaAAstatacnuccloseacnucopenal2bpalllistranksalrambas.alignmentas.matrix.alignmentas.SeqAcnucWebas.SeqFastaAAas.SeqFastadnaas.SeqFragautosocketbaselineabifbmac2scaicflchoosebankcircleclfcdclientidclosebankcol2alphacompcomputePIconconsensuscountcountfreelistscountsubseqscrelistfromclientdatacssdia.bactgensizedia.db.growthdist.alignmentdotchart.ucodotPlotdraw.orilocdraw.rearranged.orilocdraw.recstatexseqextract.breakpointsextractseqsfastaccgb2fastagbk2g2gbk2g2.eukGCGC1GC2GC3GCposget.db.growthgetAnnotgetAnnot.defaultgetAnnot.listgetAnnot.logicalgetAnnot.qawgetAnnot.SeqAcnucWebgetAnnot.SeqFastaAAgetAnnot.SeqFastadnagetAttributsocketgetFraggetFrag.charactergetFrag.defaultgetFrag.listgetFrag.logicalgetFrag.qawgetFrag.SeqAcnucWebgetFrag.SeqFastaAAgetFrag.SeqFastadnagetFrag.SeqFraggetKeywordgetKeyword.defaultgetKeyword.listgetKeyword.logicalgetKeyword.qawgetKeyword.SeqAcnucWebgetLengthgetLength.charactergetLength.defaultgetLength.listgetLength.logicalgetLength.qawgetLength.SeqAcnucWebgetLength.SeqFastaAAgetLength.SeqFastadnagetLength.SeqFraggetlistrankgetliststategetLocationgetLocation.defaultgetLocation.listgetLocation.logicalgetLocation.qawgetLocation.SeqAcnucWebgetNamegetName.defaultgetName.listgetName.logicalgetName.qawgetName.SeqAcnucWebgetName.SeqFastaAAgetName.SeqFastadnagetName.SeqFraggetNumber.socketgetSequencegetSequence.charactergetSequence.defaultgetSequence.listgetSequence.logicalgetSequence.qawgetSequence.SeqAcnucWebgetSequence.SeqFastaAAgetSequence.SeqFastadnagetSequence.SeqFraggetTransgetTrans.charactergetTrans.defaultgetTrans.listgetTrans.logicalgetTrans.qawgetTrans.SeqAcnucWebgetTrans.SeqFastadnagetTrans.SeqFraggetTypegfragghelpglnglrglsis.SeqAcnucWebis.SeqFastaAAis.SeqFastadnais.SeqFragisenumisnkakskdbknowndbslseqinrmodifylistmovemvn2sorilocparser.socketpeakabifpermutationpgaplot.SeqAcnucWebplotabifplotladderplotPanelspmwprepgetannotsprettyseqprint.qawprint.SeqAcnucWebqueryquitacnucread.abifread.alignmentread.fastareadBinsreadfirstrecreadPanelsreadsmjrearranged.orilocrecstatresiduecountreverse.alignrhorot13s2cs2nsavelistSEQINR.UTILsetlistnamesplitseqstrescstutterabifsummary.SeqFastaAAsummary.SeqFastadnaswapsyncodonssynsequencetablecodetest.co.recstattest.li.recstattranslatetrimSpaceucoucoweightwhere.is.this.accwordswords.poswrite.fastazscore

Dependencies:ade4latticeMASSnlmepixmapRcppRcppArmadillosegmentedsp

Readme and manuals

Help Manual

Help pageTopics
Biological Sequences Retrieval and Analysisseqinr-package seqinr
Converts amino-acid three-letter code into the one-letter onea
Converts amino-acid one-letter code into the three-letter oneaaa
Aerobic cost of amino-acids in Escherichia coli and G+C classesaacost
List of 544 physicochemical and biological properties for the 20 amino-acidsaaindex
To Get Some Protein StatisticsAAstat
open and close a remote access to an ACNUC databaseacnucclose acnucopen clientid quitacnuc
To Convert a forensic microsatellite allele name into its length in base pairsal2bp
To get the count of existing lists and all their ranks on serveralllistranks alr
Expansion of IUPAC nucleotide symbolsamb
Expected numeric results for Ka and Ks computationAnoukResult
Constructor for class alignmentas.alignment
as.matrix.alignmentas.matrix.alignment
Returns a socket to the last opened databaseautosocket
Estimation of baseline valuebaselineabif
Computing an IUPAC nucleotide symbolbma
conversion of a vector of chars into a stringc2s
Codon Adaptation Indexcai
Codon Adaptation Index (CAI) w tablescaitab
Base composition in ssDNA for 7 bacterial DNAchargaff
To select a database structured under ACNUC and located on the web.seqinrEnv choosebank
Draws a circlecircle
To close a remote ACNUC databaseclosebank
Example of results obtained after a call to read.alignmentclustal
To use a standard color with an alpha transparency chanelcol2alpha
complements a nucleic acid sequencecomp
To Compute the Theoretical Isoelectric PointcomputePI
Consensus and profiles for sequence alignmentscon consensus
Composition of dimer/trimer/etc oligomerscount
The number of free lists available and annotation lines in an ACNUC servercfl countfreelists
Number of subsequences in an ACNUC listcountsubseqs css
To create on server an ACNUC list from data lines sent by clientclfcd crelistfromclientdata
Distribution of bacterial genome size from GOLDdia.bactgensize
Mean zscore on 242 complete bacterial chromosomesdinucl
Statistical over- and under- representation of dinucleotides in a sequencerho zscore
Pairwise Distances from Aligned Protein or DNA/RNA Sequencesdist.alignment
Cleveland plot for codon usage tablesdotchart.uco
Dot Plot Comparison of two sequencesdotPlot
Graphical representation for nucleotide skews in prokaryotic chromosomes.draw.oriloc
Graphical representation for rearranged nucleotide skews in prokaryotic chromosomes.draw.rearranged.oriloc
Graphical representation of a recstat analysis.draw.recstat
999 coding sequences from E. coliec999
Forensic Genetic Profile Allelic Ladder Raw DataECH
Vectors of coefficients to compute linear forms.EXP
Extraction of breakpoint positions on the rearranged nucleotide skews.extract.breakpoints
To extract the sequences information of a sequence or a list of sequence in different formatsexseq extractseqs
Example of results obtained after a call to read.alignmentfasta
Fast Allele in Common Countfastacc
Calculates the fractional G+C content of nucleic acid sequences.GC GC1 GC2 GC3 GCpos
Conversion of GenBank file into fasta filegb2fasta
Conversion of a GenBank format file into a glimmer-like onegbk2g2
Conversion of a GenBank format file into a glimmer-like one. Eukaryotic version.gbk2g2.euk
GC content and aerobiosis in bacteriagcO2
GC content and temperature in bacteriagcT
Get the exponential growth of nucleic acid database contentdia.db.growth get.db.growth
Generic Function to get sequence annotationsgetAnnot getAnnot.default getAnnot.list getAnnot.logical getAnnot.qaw getAnnot.SeqAcnucWeb getAnnot.SeqFastaAA getAnnot.SeqFastadna readAnnots.socket
Generic function to extract sequence fragmentsgetFrag getFrag.character getFrag.default getFrag.list getFrag.logical getFrag.qaw getFrag.SeqAcnucWeb getFrag.SeqFastaAA getFrag.SeqFastadna getFrag.SeqFrag
Generic function to get keywords associated to sequencesgetKeyword getKeyword.default getKeyword.list getKeyword.logical getKeyword.qaw getKeyword.SeqAcnucWeb
Generic function to get the length of sequencesgetLength getLength.character getLength.default getLength.list getLength.logical getLength.qaw getLength.SeqAcnucWeb getLength.SeqFastaAA getLength.SeqFastadna getLength.SeqFrag
To get the rank of a list from its namegetlistrank glr
Asks for information about an ACNUC list of specified rankgetliststate gln gls
Generic function to get the location of subsequences on the parent sequencegetLocation getLocation.default getLocation.list getLocation.logical getLocation.qaw getLocation.SeqAcnucWeb
Generic function to get the names of sequencesgetName getName.default getName.list getName.logical getName.qaw getName.SeqAcnucWeb getName.SeqFastaAA getName.SeqFastadna getName.SeqFrag
Generic function to get sequence datagetSequence getSequence.character getSequence.default getSequence.list getSequence.logical getSequence.qaw getSequence.SeqAcnucWeb getSequence.SeqFastaAA getSequence.SeqFastadna getSequence.SeqFrag
Generic function to translate coding sequences into proteinsgetTrans getTrans.character getTrans.default getTrans.list getTrans.logical getTrans.qaw getTrans.SeqAcnucWeb getTrans.SeqFastadna getTrans.SeqFrag
To get available subsequence types in an opened ACNUC databasegetType
Extract sequence identified by name or by number from an ACNUC servergfrag
Get help from an ACNUC serverghelp
GS500LIZ size standardsgs500liz
Identifiler allele namesidentifiler
Get the ACNUC number of a sequence from its name or accession numbergetAttributsocket getNumber.socket isenum isn
Forensic Genetic Profile Raw DataJLO
Ka and Ks, also known as dn and ds, computationkaks
Expected numeric results for Ka and Ks in extreme caseskaksTorture
Description of databases known by an ACNUC serverkdb knowndbs
To see what's inside the package seqinrlseqinr
Fragment of the E. coli chromosomem16j
Example of results obtained after a call to read.alignmentmase
Modification of an ACNUC listmodifylist
Rename an R objectmove mv
Example of results obtained after a call to read.alignmentmsf
function to convert the numeric encoding of a DNA sequence into a vector of charactersn2s
Prediction of origin and terminus of replication in bacteria.oriloc
Utility function to parse answers from an ACNUC serverparser.socket
Extraction of Peak locations, Heights and Surfaces from ABIF datapeakabif
Sequence permutation according to several different modelspermutation
Example of results obtained after a call to read.alignmentphylip
pK values for the side chain of charged amino acids from various sourcespK
To Plot Subsequences on the Parent Sequenceplot.SeqAcnucWeb
Electrophoregram plot for ABIF dataplotabif
Simple plot of an allelic ladder from ABIF dataplotladder
Representation of Amplicon Size Ranges of a STR kit.plotPanels
Protein Molecular Weightpmw
Select annotation lines in an ACNUC databasepga prepgetannots
Text representation of a sequence from an ACNUC serverprettyseq
Print method for objects from class qawprint.qaw
Print method for objects from class SeqAcnucWebprint.SeqAcnucWeb
Zscore on three strains of Prochlorococcus marinusprochlo
To get a list of sequence names from an ACNUC data base located on the webquery
Read ABIF formatted filesread.abif
Read aligned sequence files in mase, clustal, phylip, fasta or msf formatread.alignment
read FASTA formatted filesFASTA read.fasta readfasta
Import GenMapper Bins configuration filereadBins
Low level function to get the record count of the specified ACNUC index filereadfirstrec
Import GenMapper Panels configuration filereadPanels
Low level function to read ACNUC SMJYT index filesreadsmj
Detection of replication-associated effects on base composition asymmetry in prokaryotic chromosomes.rearranged.oriloc
Prediction of Coding DNA Sequences.recstat
Total number of residues in an ACNUC listresiduecount
Three aligned nucleic acid sequencesrevaligntest
Reverse alignment - from protein sequence alignment to nucleic sequence alignmentreverse.align
Ergheaf gur EBG-13 pvcurevat bs n fgevatrot13
conversion of a string into a vector of charss2c
simple numerical encoding of a DNA sequence.s2n
Save sequence names or accession numbers into a filesavelist
Sequence coming from a remote ACNUC data baseas.SeqAcnucWeb is.SeqAcnucWeb SeqAcnucWeb
AA sequence in Fasta Formatas.SeqFastaAA is.SeqFastaAA SeqFastaAA summary.SeqFastaAA
Class for DNA sequence in Fasta Formatas.SeqFastadna is.SeqFastadna SeqFastadna summary.SeqFastadna
Class for sub-sequencesas.SeqFrag is.SeqFrag SeqFrag
utility data for seqinrSEQINR.UTIL
Sets the name of an ACNUC list identified by its ranksetlistname
split a sequence into sub-sequencessplitseq
Utility function to escape LaTeX special characters present in a stringstresc
Stutter ratio estimationstutterabif
Exchange two R objectsswap
Synonymous codonssyncodons
Random synonymous coding sequence generationsynsequence
to plot genetic code as in textbookstablecode
Tests if regions located between Stop codons contain putative CDSs.test.co.recstat
Tests if regions located between Stop codons contain putative CDSs.test.li.recstat
A toy example of amino-acid counts in three proteinstoyaa
A toy example of codon counts in three coding sequencestoycodon
Translate nucleic acid sequences into proteinstranslate
Trim leading and/or trailing spaces in stringstrimSpace
Codon usage indicesrscu uco
Weight of each synonymous codonucoweight
Light absorption by the water columnwaterabs
Scans databases for a given sequence accession numberwhere.is.this.acc
To get all words from an alphabet.words
Positions of possibly degenerated motifs within sequenceswords.pos
Write sequence(s) into a file in fasta formatwrite.fasta