Skip to content

furthlab/NPC-proteomics

Repository files navigation

Human Nuclear Pore Complex (NPC) Protein FASTA

7R5K PDB structure

UniProt Accession Protein name Gene
P37198 Nuclear pore glycoprotein p62 (Nup62) NUP62
O00410 Nucleoporin Nup58 NUP58
P46781 Nucleoporin Nup54 NUP54
P52948 Nucleoporin Nup98 / Nup96 precursor NUP98
P57740 Nuclear pore complex protein Nup107 NUP107
Q8WUM0 Nuclear pore complex protein Nup133 NUP133
Q12769 Nuclear pore complex protein Nup160 NUP160
Q9BW27 Nuclear pore complex protein Nup85 NUP85
Q8NFH4 Nucleoporin Nup37 NUP37
Q8NFH3 Nucleoporin Nup43 NUP43
Q96DB2 Nucleoporin Seh1 SEH1
Q96B26 Nucleoporin Sec13 SEC13
O75694 Nuclear pore complex protein Nup155 NUP155
Q92621 Nuclear pore complex protein Nup205 NUP205
Q8WVZ8 Nuclear pore complex protein Nup188 NUP188
H3BMX0 Nuclear pore complex protein Nup93 (fragment) NUP93
Q8NFH5 Nucleoporin Nup35 NUP35
A8K3Z5 Nucleoporin Nup53 NUP53
P57737 Nuclear pore complex protein Nup50 NUP50
P49790 Nuclear pore complex protein Nup153 NUP153
Q9Y6X5 Nuclear pore complex protein TPR TPR
P35658 Nuclear pore complex protein Nup214 NUP214
Q99567 Nuclear pore complex protein Nup88 NUP88
Q8TEM1 Nuclear pore membrane glycoprotein 210 NUP210
Q9H3G5 Nuclear division cycle protein 1 (NDC1) NDC1
O75696 Nuclear pore membrane protein POM121 POM121
O95625 WD-repeat nucleoporin ALADIN AAAS

Overview

This directory contains a FASTA file (human_NPC.fasta) comprising canonical sequences of human nuclear pore complex (NPC) proteins, also called nucleoporins (NUPs).

Sequences were retrieved from the UniProt human reference proteome (UniProtKB) using the UniProt REST API.

Due to query limits on the UniProt /stream endpoint, sequences were fetched in small batches and merged into a single FASTA file.


Definition of “Nuclear Pore Complex Protein”

NPC proteins here are core structural nucleoporins, excluding transient transport receptors (e.g., importins/exportins).

Included representatives cover all major NPC substructures:

  • Central channel FG-nucleoporins
  • Outer ring (Nup107–160 complex)
  • Inner ring scaffold
  • Nuclear basket
  • Cytoplasmic filaments
  • Pore membrane (transmembrane) nucleoporins

This corresponds to the canonical ~30 nucleoporins in vertebrates.


NPC Protein List (UniProt Accessions)

Category Gene UniProt Accession
Central channel NUP62 P37198
NUP58 O00410
NUP54 P46781
NUP98 / NUP96 (precursor) P52948
Outer ring (Nup107–160) NUP107 P57740
NUP133 Q8WUM0
NUP160 Q12769
NUP85 Q9BW27
NUP37 Q8NFH4
NUP43 Q8NFH3
SEH1 Q96DB2
SEC13 Q96B26
Inner ring scaffold NUP155 O75694
NUP205 Q92621
NUP188 Q8WVZ8
NUP93 H3BMX0
NUP35 Q8NFH5
NUP53 A8K3Z5
Nuclear basket NUP50 P57737
NUP153 P49790
TPR Q9Y6X5
Cytoplasmic filaments NUP214 P35658
NUP88 Q99567
Pore membrane NUP210 Q8TEM1
NDC1 Q9H3G5
POM121 O75696
WD-repeat NUP AAAS (ALADIN) O95625

Total expected sequences: ~30
(NUP98 and NUP96 are derived from a single precursor UniProt entry P52948)


Data Retrieval Method

Why batching was required

  • UniProt /stream requires a query parameter even for known accessions
  • Long queries can silently fail or return empty results
  • Querying in smaller batches avoids these issues

Batches of ≤7 proteins were queried using explicit accession:X OR accession:Y strings, then merged after validation.

Script used

The file fetch_human_NPC.sh:

  • Fetches all NPC proteins in batches
  • Validates each batch contains FASTA entries
  • Concatenates results into human_NPC.fasta

Output

  • File: human_NPC.fasta
  • Content: Canonical UniProt sequences (no isoforms)
  • Species: Homo sapiens
  • Source: UniProtKB (Swiss-Prot preferred)
grep -c "^>" human_NPC.fasta

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors