This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| data:index [2011/01/20 19:29] – sillitoe | data:index [2017/10/14 15:28] (current) – sillitoe | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| + | |||
| + | ====== CATH Data Downloads ====== | ||
| + | |||
| + | This page provides information on the data files that are available to download from the [[ftp:// | ||
| + | |||
| + | See [[: | ||
| + | |||
| + | ===== CATH (daily snapshot) ===== | ||
| + | |||
| + | ftp:// | ||
| + | |||
| + | ^ File name ^ Description ^ | ||
| + | | cath-b-newest-all.gz | List the latest domain boundaries and superfamily (C.A.T.H) annotations for all CATH domains | | ||
| + | | cath-b-newest-names.gz | Provides the names for each node in the CATH hierarchy | | ||
| + | | cath-b-newest-latest-release.gz | List the latest domain boundaries and superfamily annotations for CATH domains in the most recent release of CATH-Plus | | ||
| + | | cath-b-newest-putative.gz | List the latest domain boundaries and superfamily annotations for CATH domains released since the most release release of CATH-Plus | | ||
| + | | cath-b-s35-newest.gz | List the latest domain boundaries and sequence family (C.A.T.H.S) annotations for all non-redundant sequence representatives | | ||
| + | |||
| + | ===== CATH-Plus (full release) ===== | ||
| + | |||
| + | ftp:// | ||
| + | |||
| + | For information on the statistics for specific releases, see [[../ | ||
| + | |||
| + | ==== CATH classification data ==== | ||
| + | |||
| + | ftp:// | ||
| + | |||
| + | ^ File name ^ Description ^ | ||
| + | | cath-chain-list-< | ||
| + | | cath-domain-boundaries-*-< | ||
| + | | cath-domain-description-file-< | ||
| + | | cath-domain-list-< | ||
| + | | cath-domain-pdb-*-< | ||
| + | | cath-names-< | ||
| + | | cath-superfamily-list-< | ||
| + | | cath-unclassified-list-< | ||
| + | |||
| + | ==== Non-redundant data sets ==== | ||
| + | |||
| + | ftp:// | ||
| + | |||
| + | ^ File name ^ Description ^ | ||
| + | | cath-dataset-nonredundant-S[20%%|%%40].atom.fa | The ATOM sequences of the domains in the dataset (which only contain residues that have ATOM records in the PDB file) | | ||
| + | | cath-dataset-nonredundant-S[20%%|%%40].fa | The sequences of the domains in the dataset | | ||
| + | | cath-dataset-nonredundant-S[20%%|%%40].list | A list of the domains in the dataset; one domain ID per line | | ||
| + | | cath-dataset-nonredundant-S[20%%|%%40].pdb.tgz | (A gzipped tar file containing) the PDB files of the domains in the data set | | ||
| + | |||
| + | ==== Sequence data ==== | ||
| + | |||
| + | ftp:// | ||
| + | |||
| + | ^ File name ^ Description ^ | ||
| + | | cath-domain-seqs-*-< | ||
| + | | cath-S35-< | ||
| + | | funfam-hmm3-< | ||
| + | | cath-superfamily-seqs-< | ||
| + | |||