MOLECULAR BIOLOGY
OF HUMAN PAPILLOMAVIRUSES
1. Human papillomaviruses (HPVs) are a family
of small DNA viruses consisting of over 150 related genetic types, of which
more than 85 are fully sequenced. The
genome is encapsidated in an icosahedral protein coat, and there is no membrane
envelope. The HPVs are rather closely
related through phylogenetic and biological properties to the animal
papillomaviruses, which are host-specific to other vertebrates including
amphibians, reptiles, birds and a variety of land and sea mammals. HPVs do not infect any other species, nor do
any of the animal papillomaviruses infect humans.
2. The HPVs target either cutaneous or mucosal
epithelium, each type at a preferred body site, and cause warts. Plantar, common and flat warts of the
cutaneous, external epithelia are usually associated with HPV genotypes 1, 2, 3
or 4. HPV -5, -8 and related types
cause the rare skin disease epidermodysplasia
verruciformis, from which 40% of the patients progress to cancer. HPV types 6 and 11 cause benign ano-genital
condylomas, upper aerodigestive tract papillomas (most notably laryngeal
papillomas), and conjunctival and ear canal papillomas. Infections with HPV-16 and HPV-18 and
numerous related genotypes often occur in or near the squamo-columnar junctions
of the cervix, penis, anus, or aerodigestive tracts. These lesions can progress from benign condylomata to dysplasias
and occasionally to carcinomas. New
papillomavirus-associated cancers arise in about 600,000 patients in the world
each year. When caught early in the
disease process, they can be treated surgically. There are but few effective antiviral drugs, and prophylactic and
therapeutic vaccines are only in the initial stages of development and testing.
3. The viral genome consists of a circular
double-stranded DNA approximately 7,900 base pairs long (range 7400-8200). Most viral genotypes have a very similar
genetic, transcriptional and functional
organization consisting of a transcription and replication control
region, an early (E) region encoding proteins for replication, regulation and
modification of the host cytoplasm and nucleus, and a late (L) region encoding
capsid proteins.
4. The upstream regulatory region (URR), also
known as the long control region (LCR), contains the origin of DNA replication,
one or two promoters, transcriptional enhancer and repressor protein binding
sites, and the late RNA poly-adenylation site.
It is subject to very complex interactions with both viral and host
regulatory proteins.
5. All genes are transcribed from the same DNA
strand. There are several promoters
active at different stages of infection, and separate early and late
polyadenylation sites are used.
Transcripts are processed using a large number of alternative splice
donor and acceptor site combinations, allowing them to encode sets of related
proteins that share some common sequences and perform related functions.
6. The viral DNA persists as nuclear,
extra-chromosomal plasmids at moderate copy numbers (eg. 20 episomes per cell)
in the cycling basal and parabasal keratinocytes. Infections generally remain subclinical. Productive infections that can culminate in
the release of progeny virions require terminal differentiation of stratified
epithelium to enable the activation of differentiation-dependent viral
promoters. Reactivation of a latent
infection leads to a programmed sequence of high level expression of the viral
genes, vegetative DNA replication and virion morphogenesis in the superficial
strata of the epithelium.
7. The E1 and E2 genes, at least, are expressed
in basal and parabasal cells. Families of alternative E1 and E2 proteins serve
as regulatory factors that either repress or activate transcription. They also comprise the viral replication
origin-binding proteins that recruit the host replication proteins to the
origin. The host heat shock proteins
hsp70 and hsp40 facilitate the assembly of the pre-replication complex,
particularly the formation of a di-hexamers of E1 joined by hsp40 into two
rings around the DNA. These E1:hsp40
complexes remain associated with the replication elongation complex as an
ATP-dependent DNA helicase, and E1 protein is the only enzyme encoded by the
papillomaviruses. Host DNA
polymerase is recruited by E1, and the
single-stranded DNA binding protein RPA by E2 protein. E1, E2, pol subunits p180 and p70, and other proteins in the preinitiation
complex are phosphorylated by cyclin E/cdk2 complexes to trigger the elongation
phase of replication, thus providing the biochemical link to cell cycle
control.
The
E5 trans-membrane protein is found in the plasma membrane as well as in the
Golgi and endoplasmic reticulum membranes.
It modulates EGF receptor activity and other protein trafficking. The E4 protein directly or indirectly
interacts with cytokeratin filaments (for unknown reasons).
E6
and E7 genes are normally expressed in the differentiated spinous strata of the
epithelium. E7 protein is essential to
establish cellular conditions under which the viral DNA can replicate in the
non-cycling, differentiated keratinocytes of squamous epithelia, through the
following mechanism: The host
retinoblastoma susceptibility (tumor suppressor) protein is a cell cycle
regulatory protein which recruits cellular histone deacetylase to particular
promoters controlled by the E2F:DP family of transcription factors. Promoters associated with pRB - E2F:DP complexes are thereby repressed due
to chromatin condensation. The
papillomaviral E7 protein dissociates pRB from E2F, leading to the local
decondensation of the chromatin and induction of a large suite of host genes
needed for deoxyribonucleotide synthesis and DNA replication. E7 also interacts with pRB-related proteins
including p107 and p130. Moreover, it
activates an additional pathway for induction of proliferating cell nuclear
antigen (PCNA), the processivity factor for host DNA polymerase- . The cells move into S phase, and both
cellular and viral DNA replicate.
However, such reactivated, differentiated cells do not generally
progress to mitosis.
E6
protein binds to and inhibits the host cell p53 tumor suppressor protein. Among other consequences, this delays both
programmed and defensive apoptosis of the keratinocytes, most likely to allow
more time for viral reproduction.
E7-mediated progression of the differentiated cells into S phase also
triggers the translation of pre-existing host mRNA encoding the p21cip1 inhibitor
of cyclin-dependent kinases and of PCNA.
p21cip1 binds to the cyclin D/cdk2 and cyclinE/cdk4 complexes, and the cell cycle transition
from G1 phase into S phase is blocked.
p21cip1 also can bind to PCNA, and those DNA replication forks already
engaged in synthesis are blocked from further elongation. However, such unscheduled DNA replication is
successfully inhibited some but not all of the infected cells. Thus some cells accumulate inhibitory
amounts of inactive p21cip1 / cyclin E / PCNA complexes, while others fail to
accumulate p21cip1, and the active PCNA is able to support de novo synthesis of
viral and host DNA. Apparently the
timing of cyclin E induction and p21cip1 induction relative to the availability
of other host replication factors helps determine the fate of individual,
infected cells. As a consequence,
natural and experimental papillomas are a mosaic of productive and inhibited
cells.
L1
is the major capsid protein and L2 is the minor capsid protein. E2 protein most likely participates in
selecting viral DNA for packaging into progeny virions by associating with the
E2 binding motifs in the replication origin and also complexing with the L2
protein..
8. Wound healing of HPV-infected epithelium can
result in temporary up-regulation of the E6-E7 genes in basal-like cells. In addition, mutations of the host cell or
virus that result in over-expression of the viral E6 and E7 genes in basal and
parabasal stem/reserve cells may trigger excessive cell cycling. Thus, if the viral DNA integrates into a
host cell chromosome with local insertional damage, with the deletion of the
viral E2 regulatory gene, and/or with up-regulatory effects of the flanking
host sequences on the E6-E7 promoter, the infected cells may become able to
proliferate and accumulate secondary mutations that eventually can lead to
carcinomas. In overview, the different
patterns of regulation of E6 and E7 expression are very complex and can account
for apparently contradictory observations in natural and experimental settings.
9. A healthy cell-mediated immune system down-modulates HPV activity by mechanisms not yet known, while transient or long-term immunosuppression can lead to reactivation of latent infections, onset of epithelial hyperproliferation and pathologic changes, and virus reproduction.