5EG0 | pdb_00005eg0

HOXB13-MEIS1 heterodimer bound to DNA


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.10 Å
  • R-Value Free: 
    0.322 (Depositor), 0.320 (DCC) 
  • R-Value Work: 
    0.285 (Depositor), 0.300 (DCC) 
  • R-Value Observed: 
    0.288 (Depositor) 

Starting Model: experimental
View more details

wwPDB Validation   3D Report Full Report


This is version 1.2 of the entry. See complete history


Literature

DNA-guided transcription factor interactions extend human gene regulatory code.

Xie, Z.Sokolov, I.Osmala, M.Yue, X.Bower, G.Pett, J.P.Chen, Y.Wang, K.Cavga, A.D.Popov, A.Teichmann, S.A.Morgunova, E.Kvon, E.Z.Yin, Y.Taipale, J.

(2025) Nature 

  • DOI: https://doi.org/10.1038/s41586-025-08844-z
  • Primary Citation of Related Structures:  
    5EG0, 5NO6, 8BYX, 8BZM, 8R7F, 8R7Z

  • PubMed Abstract: 

    In the same way that the mRNA-binding specificities of transfer RNAs define the genetic code, the DNA-binding specificities of transcription factors (TFs) form the molecular basis of the gene regulatory code 1,2 . The human gene regulatory code is much more complex than the genetic code, in particular because there are more than 1,600 TFs that commonly interact with each other. TF-TF interactions are required for specifying cell fate and executing cell-type-specific transcriptional programs. Despite this, the landscape of interactions between DNA-bound TFs is poorly defined. Here we map the biochemical interactions between DNA-bound TFs using CAP-SELEX, a method that can simultaneously identify individual TF binding preferences, TF-TF interactions and the DNA sequences that are bound by the interacting complexes. A screen of more than 58,000 TF-TF pairs identified 2,198 interacting TF pairs, 1,329 of which preferentially bound to their motifs arranged in a distinct spacing and/or orientation. We also discovered 1,131 TF-TF composite motifs that were markedly different from the motifs of the individual TFs. In total, we estimate that the screen identified between 18% and 47% of all human TF-TF motifs. The novel composite motifs we found were enriched in cell-type-specific elements, active in vivo and more likely to be formed between developmentally co-expressed TFs. Furthermore, TFs that define embryonic axes commonly interacted with different TFs and bound to distinct motifs, explaining how TFs with a similar specificity can define distinct cell types along developmental axes.


  • Organizational Affiliation
    • State Key Laboratory of Cardiovascular Diseases and Medical Innovation Center, Shanghai East Hospital, School of Medicine, Tongji University, Shanghai, China.

Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Homeobox protein Meis257Homo sapiensMutation(s): 0 
Gene Names: MEIS2MRG1
UniProt & NIH Common Fund Data Resources
Find proteins for O14770 (Homo sapiens)
Explore O14770 
Go to UniProtKB:  O14770
PHAROS:  O14770
GTEx:  ENSG00000134138 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupO14770
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 4
MoleculeChains Sequence LengthOrganismDetailsImage
Homeobox protein Hox-B13D [auth B]61Homo sapiensMutation(s): 0 
Gene Names: HOXB13
UniProt & NIH Common Fund Data Resources
Find proteins for Q92826 (Homo sapiens)
Explore Q92826 
Go to UniProtKB:  Q92826
PHAROS:  Q92826
GTEx:  ENSG00000159184 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ92826
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 2
MoleculeChains LengthOrganismImage
DNA (5'-D(P*GP*TP*TP*GP*AP*CP*AP*GP*TP*TP*TP*TP*AP*CP*GP*AP*GP*G)-3')B [auth D]18synthetic construct
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 3
MoleculeChains LengthOrganismImage
DNA (5'-D(*CP*CP*TP*CP*GP*TP*AP*AP*AP*AP*CP*TP*GP*TP*CP*AP*AP*C)-3')C [auth E]18synthetic construct
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.10 Å
  • R-Value Free:  0.322 (Depositor), 0.320 (DCC) 
  • R-Value Work:  0.285 (Depositor), 0.300 (DCC) 
  • R-Value Observed: 0.288 (Depositor) 
Space Group: P 21 21 21
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 41.726α = 90
b = 56.332β = 90
c = 115.152γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
PDB_EXTRACTdata extraction
XDSdata reduction
SCALAdata scaling
PHASERphasing

Structure Validation

View Full Validation Report



Entry History 

Deposition Data

Revision History  (Full details and data files)

  • Version 1.0: 2016-11-09
    Type: Initial release
  • Version 1.1: 2024-01-10
    Changes: Data collection, Database references, Refinement description
  • Version 1.2: 2025-04-23
    Changes: Database references, Structure summary