spacer

InterPro: Home

InterPro is a database of protein families, domains, repeats and sites in which identifiable features found in known proteins can be applied to new protein sequences.

Release News

Announcement:
  • InterPro 18.0 is released and covers 75.6% of UniProtKB, with new methods from PROSITE, GENE3D and SUPERFAMILY.
  • PROSITE pattern matches are now evaluated to either TRUE (T) or UNKNOWN (?) using miniprofiles or associated existing PROSITE profiles.

Please see Release Notes for further details.

General Information:

  • Match_complete.xml (UniProtKB) now contains all UniProtKB proteins including those not matching an InterPro signature.
  • UniParc (uniparc_match.tar.gz) and UniMES (unimes_match.tar.gz) matches to InterPro methods have been updated and are available from the ftp site in XML format.

Note: due to the large size of UniParc and UniMES the data has been divided into chunks and the latest updates are provided in these files at each InterPro release.

Future proposed changes:

InterPro will be introducing new entry classification rules that will affect how an entry is typed:

  1. Entries typed Repeat or Site will remain the same.
  2. Entries typed Family or Domain will follow stricter criteria to ensure they conform more closely to current biological concepts:
    • Entries typed Family will contain signatures that cover all domains in the matching proteins.
    • Entries typed Domain will identify biological units with defined boundaries, which includes structural domains/subdomains as well as functional domains.
    • All remaining entries will be covered by a new type, Region including those which cover more than one domain, as well as those covering partial domain(s).
  3. New relationship rules will be introduced that will affect how different entries are related to one another. Parent/Child and Contains/Found in relationships will continue within InterPro with their existing definitions, but the following changes will occur:
    • Entry type will no longer have any bearing on the relationships of that entry. Instead, only the sequence covered by the signatures of an entry will be taken into consideration when forming relationships.
    • Parent/Child relationships will be permitted between entries of different types.
    • All Contains/Found In relationships for an entry will be displayed in the Relationships section of an entry (currently, only the most specific are displayed).

Any concerns or comments regarding the proposed changes should be directed to EBI Support.

User support and feedback

We welcome feedback, particularly if you find errors or omissions please let us know. If you need information or help, have any comments and/or suggestions on the InterPro database, please contact us at EBI Support.

InterPro Funding

Current InterPro Funding:

Impact/E infrastructure logo

InterPro is currently funded by grant number 213037 from the European Union under the program "FP7 capacities: Scientific Data Repositories". The working title for the project is IMproving Protein Annotation and Co-ordination using Technology (IMPACT).

InterPro is also funded by grant BB/F010508/1 from the BBSRC Bioinformatics and Biological Resources Fund.

Previous InterPro Funding:

InterPro was funded by the award of grant number QLRI-CT-2000-00517 and in part by grant number QLRI-CT-2001000015 from the European Union under the RTD program "Quality of Life and Management of Living Resources". InterPro was also part of the MRC-funded eFamily project.

spacer