PIR Peptide Match allows users to quickly retrieve all occurrences for a given query peptide from the UniProtKB protein sequences.
The matched results are tabulated and can be narrowed down to a specific set of organisms by browsing the
taxonomy tree or taxonomy group. We also provide web interface for multiple peptide match with summary
statistics for matched taxonomy and organisms, programmtic access via RESTful web services and a standalone
Java command-line software package that allows users to integrate the Peptide Match Services to their local
pipeline for proteomics data analysis.
If you want to link your peptide sequence directly to our search engine, please use the following url:
http://proteininformationresource.org/cgi-bin/peptidematch?peptide=XXXX (e.g. http://proteininformationresource.org/cgi-bin/peptidematch?peptide=AAVEEGIVLGGGCALLR).
User can find an exact match for a peptide sequence query in the selected database. The search can be performed
against:
UniProtKB, which is the central hub for the collection of functional information on proteins, with accurate, consistent,
and rich annotation. It consists of two sections: a section containing manually-annotated records with
information extracted from literature and curator-evaluated computational analysis (UniProtKB/Swiss-Prot),
and a section with computationally analyzed records that await full manual annotation (UniProtKB/TrEMBL).
A subset of UniProtKB entries belonging to a certain organism which is a complete proteome.
A subset of UniProtKB entries beloing to a set of organisms which are reference proteomes.
The main user interface for single peptide match is shown below. The first section is mainly for selecting
database which will be used in the search. The second section is for user to input the peptide sequence.
The "?" on the upper-right corner is a link to detailed user guide.
By default, the entire UniProtKB will be searched. However, if a user is only interested in a certain organism
or a set of organsims, only a subset of the UniProtKB entries will be searched. For example, if a user
is interested in an organism in the UniProt
complete proteome set, A complete proteome consists of a set of proteins of organism whose genome
has been completely sequenced, the second option allows user to specify the organism by either entering
the organism name or corresponding NCBI taxonomy ID as shown below.
If a user is interesed in a larger set of organisms and want to select multiple organisms to search against, the third option
alphabetically organizes the names of UniProt
reference proteome set. A reference proteome is the complete proteome of well-studied model organism
or organism related to biomedical research. The selected organisms are tracked while user is making multiple
selections as shown below.
Once the search is done, the matched results will be displayed as table shown below.
The table is organized into 9 sections:
1 - Query Peptide Sequence Show the query peptide sequence as well as the sequence data set, search options selected by the user.
2 - Summary of Matched Organisms The maxium top 5 most matched organisms and the rest of organisms are shown as pie chart with the
information of the number and percent of proteins in each category. Click the count number in each
organism will show the matched proteins of that organism. The "Others" link links to the paginated
table of all the matched organisms and corresponding matched proteins as shown in section 6.
3- Summary of Matched Taxonomy Groups The maxium top 5 most matched taxonomy groups and the rest of taxonomy groups are shown as pie chart
with the information of the number and percent of proteins in each category. Click the count number
in each taxononmy group will show the matched proteins of that taxonomy group. "Others" link links
to the taxononmy group view as shown in section 7.
4 - Paginated matched proteins Show the pagination information of the matched proteins and allow users to navigate to different pages.
5 - Browse by organism Click to view the matched proteins organized by organisms as shown below.
6 - Browse by taxonomic group Click to view the taxonomy groups of matched proteins as shown below.
On the top is information about the query peptide sequence user entered, the matched organisms and the
total number of protein sequences matched. Then the matched results are divided into 4 tables: Archaea,
Bacteria, Eukaryota and Others. In each table, the first column is the taxonomy groups with the links
to NCBI taxonomy. The second column is the number of matched protein entries and detailed view of those
matched proteins. The third column is the % of matched proteins in its corresponding taxonomy group.
7 - Browse by taxonomic tree Click to view the taxonomy tree of matched proteins as shown below. Each tree node includes the taxonomy
name with links to NCBI taxonomy, the rank of taxonomy if it does exist, and finally the number of
proteins whose lineage contain this taxonomy. The number links to a tab view of those proteins.
8 - Save current table view and do analysis: BLAST, FASTA, Pattern Match, Multiple Alignment and Domain
Display
The current view of the result table can be saved to the user's local computer. The results will
be saved for selected entries or, if no proteins are selected, for all entries. Clicking "Table" will
save the displayed columns as a Tab-delimited text file, which may be imported into a spreadsheet
for easier viewing or analysis. Clicking "FASTA" will save the IDs and sequences in FASTA format:
User can selected matched protein entries for further analyzed using the sequence analysis programs available in the Results
page. First, select the protein(s) using the checkboxes on the left side of the table, then click the
corresponding analysis tool.
Click "BLAST" or "FASTA" button, and a new query page will be displayed, along with the parameters
that were selected in the initial search.
Click "Pattern Match" to search against the PROSITE database.
For multiple alignment, check at least 2 proteins (but no more than 70), then click the "Multiple Alignment"
button. This will open the Multiple alignment form from which you can select one of the alignment
programs: ClustalW, T-Coffee or Muscle. The result page will display the alignment and alignment viewer.
For ClustalW and T-Coffee, the neighbor-joining tree and alignment can be viewed, edited and saved
using either PIR-TAV viewer or JalView. For Muscle only Jalview is available.
9 - Results Table The matched proteins are displayed in a table with the following columns:
Protein AC/ID The Protein AC/ID refers to the UniProtKB identifiers. Below these numbers, user may choose either
the iProClass or the UniProtKB view of each protein entry. The source of the UniProtKB sequence is
shown as UniProtKB/Swiss-Prot or UniProtKB/TrEMBL if the protein sequence is from Swiss-Prot or TrEMBL,
respectively.
If User specified that the search should be limited to UniRef100 representative sequence, then "
UniRef100 Cluster ID" and corresponding "
Representative Protein AC" will be displayed.
Protein Name The Common name given to a protein, that identifies its function or specifies its features.
Length Number of amino acid residues in the matched protein.
Organism Name The genus and species of the source organism from which the sequence is originated. Links to NCBI
taxonomy information is provided.
Protein Links to Proteomics Databases If a protein can be found in NIST peptide library, PRIDE or PeptideAtlas databases, the column will
show the corresponding links to these external proteomics databases.
Immune Epitope Database Links to Immune Epitope Database and Analysis Resource.
Match Range This column displays in red is the query peptide within the sequence.
User can find the exact match for a set of peptide sequence queries in the selected database. The search
can be performed against:
UniProtKB, which is the central hub for the collection of functional information on proteins, with accurate, consistent,
and rich annotation. It consists of two sections: a section containing manually-annotated records with
information extracted from literature and curator-evaluated computational analysis (UniProtKB/Swiss-Prot),
and a section with computationally analyzed records that await full manual annotation (UniProtKB/TrEMBL).
A subset of UniProtKB entries belonging to a certain organism which is a complete proteome.
A subset of UniProtKB entries beloing to a set of organisms which are reference proteomes.
The batch peptide match results can be downloaed for further analysis.
The main user interface for multiple peptide match is shown below. It is very similar to the single peptide
match. The "Sequence data set selection" section is exactly the same as in Single Peptide Match as shown
above. The only difference is in "Input query peptides" section. We allow user to enter multiple peptide
sequences, one per line or put them together as a text file, one peptide per line and upload them.
Depending on the user inputs, the multiple peptide match may take a while to complete. Therefore, we provide
a progress report as shown below which shows the peptide sequence which is currently searching.
Once the whole batch job is successfully finished, the job id becomes a link, clicking it will direct
the user to select the types of match reports to download for further analysis.
The example match reports can be viewed from the links below:
In addition to the web interface to the Peptide Match web application, we also provide access to it via RESTful web services.
REST (or REpresentational State Transfer) refers to a style of building web services which makes it easy
to interact programmatically with the web site. A programmatic interface, also called an Application Programming
Interface (API) allows users to write scripts or programs to do peptide match, rather than having to rely
on a browser to do the search on the web site, which makes it more streamline with the downstream analysis.
Below is a list of services provided.
In general, user needs to provide the following parameters: the query peptide sequence(s), database ranges (either the whole
UniProtKB or a subset of organisms) and finally, the format of the results. Currently, we support tab,
xls, fasta, and xml formats as described below:
tab returns the matched protein(s) in a Tab-delimited format with the following information:
Protein acession number
Protein ID
Protein name
Protein sequence length
Organism name
Matched range(s) (maybe multiple, if the query peptide matches multiple locations in a protein sequence)
Proteomic databases
IEDB
xls returns the matched protein(s) with the same information as in
tab foramt for easy importing into Excel.
fasta returns the matched protein(s) in FASTA format: The Def line of each protein record in the fasta
file contains the following information separated by "^|^":
Protein acession number
Protein ID
Protein name
PIRSF ID
PIRSF name
Organism name
Organism taxonomy ID
Organism taxonomy group name
Organism taxonomy group ID
Matching peptides with their corresponding matched ranges
xml returns the full information of matched protein(s) in XML format: The XML schema can be found
here.
Example 1: get matches of a peptide sequence in UniProtKB
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative sequences,
not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all UniProtKB sequences plus isoforms, limited to UniRef100 representative
sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Example 2: get matches of a peptide in a set of organisms from UniProtKB
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms in UniProtKB, not
limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output
is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in XLS format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent,
output is in XML format:
Example 4: get matches of a list of peptides in UniProtKB
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in XLS format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XML
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in XLS format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, not limited
to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is
in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA
format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all UniProtKB sequences plus isoforms, limited to
UniRef100 representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XML
format:
Example 5: get matches of a list of peptides in a set of organisms from UniProtKB
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences plus isoforms
in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, not limited to UniRef100 representative sequences, not treat Leucine (L)
and Isoleucine (I) equivalent, output is in XML format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in Tab-delimited format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in XLS format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in FASTA format:
Searching "AVEEGIVLGGGCALLR" and "SVQYDDVPEYK" in all Homo sapiens (9606) and Mus musculus (10090) sequences
plus isoforms in UniProtKB, limited to UniRef100 representative sequences, treat Leucine (L) and Isoleucine
(I) equivalent, output is in XML format:
Example 6: get matches of a list of peptides in a set of organisms from UniProtKB using files as inputs
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms (taxonomy Ids listed
in "organismFile.txt") of UniProtKB sequences plus isoforms, not limited to UniRef100 representative
sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, not limited to UniRef100
representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, not limited to UniRef100
representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, not limited to UniRef100
representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, not limited to UniRef100
representative sequences, not treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, limited to UniRef100
representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, limited to UniRef100
representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in Tab-delimited
format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, limited to UniRef100
representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XLS format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, limited to UniRef100
representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in FASTA format:
Searching query peptides (one peptide per line, listed in the file "queryFile.txt") within the organisms
(taxonomy Ids listed in "organismFile.txt") of UniProtKB sequences plus isoforms, limited to UniRef100
representative sequences, treat Leucine (L) and Isoleucine (I) equivalent, output is in XML format: