Pseudomonas sp. strain B13 is a sewage isolate capable of utilizing 3-chlorobenzoate (3CBA) as its sole carbon and energy source (
14). The degradation of 3CBA involves an initial oxidation to chlorocatechols, which are subsequently converted to 3-oxoadipate by the action of four enzymes of the modified
ortho cleavage pathway, encoded by the
clcABDE genes (
15). The
clc genes have been transferred from strain B13 to different
Pseudomonas recipient bacteria, thereby enabling the recipients to degrade chlorocatechols as well (
22,
26,
27,
40,
41). We have recently demonstrated that the
clc genes are located on a 105-kb mobile element (named the
clcelement) which is the transfer determinant and is capable of integrating in the chromosome (
25,
34). The original host
Pseudomonas sp. strain B13 also carries two nonadjacent chromosomal copies of the
clc element, although the isolation of small amounts of a 110-kb plasmid (pB13) carrying the
clc genes in strain B13 has been reported elsewhere (
10). The
EcoRI restriction patterns of pB13 and the integrated
clc element were basically identical, and the apparent 5-kb size difference was due only to inaccurate sizing of the largest
EcoRI fragments (
25). This suggested that pB13 and the integrating
clc element exist in two different forms of the same entity, i.e., an integron and a free “plasmid.”
The chromosomal location of the
clc element was demonstrated by Southern hybridization on digested chromosomal DNAs separated by pulsed-field gel electrophoresis for transconjugants of
Pseudomonas putida F1,
P. putida BN10,
Burkholderia cepacia WR401,
Alcaligenes eutrophusCH34, and
Ralstonia spp. (
34). Some transconjugants carried only one chromosomal copy of the
clcelement, others carried two, and the F1 transconjugants carried up to eight copies (
25,
34). Interestingly, chromosomal integrations in the F1 transconjugants occurred in two loci, with tandem amplification mainly in one locus. Integration of the
clc element was shown to be RecA independent and site specific and should therefore have been mediated by functions encoded on the element itself (
25). The integration sites in F1 were both identified as glycine tRNA structural genes, and the integrations appeared to take place at the 3′ end of the tRNA gene. A wide variety of genetic elements are known to integrate into the host chromosome by means of site-specific recombinases which use tRNA genes as their target sites. Such elements include the bacteriophages φR73 (
17,
37), P4 and P22 (
24), and T12 (
21); insertional actinomycete plasmids (
11); virulence determinants of
Dichelobacter nodosus (
11,
12) and of
Vibrio cholerae (
18); and the
Bacteroides NBU1 element (
33).
In this paper, we present the characterization of a novel, unusually long recombinase gene (int-B13) of the phage P4 integrase family and demonstrate its function in site-specific integration of the clc element. To our knowledge, this is the first time that a bacteriophage-related integrase has been shown to be associated with horizontal transfer of genes involved in degradation of aromatic substances, further demonstrating the importance of this class of genetic elements in bacterial evolution.
DISCUSSION
To the right end of the mobile
clc element from
Pseudomonas sp. strain B13, we localized functions involved in site-specific chromosomal integration. An ORF (
int-B13) coding for an integrase of the bacteriophage P4 subfamily started approximately 200 bp from the junction between the element’s right end and the chromosomal target, a glycine tRNA structural gene (Fig.
1). The sequence similarity of the 657-aa product of the
int-B13 gene to P4-related integrases and a demonstration of the gene’s functionality gave evidence that Int-B13 was responsible for site-specific integrative recombination between the
clc element’s attachment site (
attP) and chromosomal attachment sites (
attB sites). Based on these results, we speculate that the
int-B13 gene is also responsible for site-specific chromosomal integration of the complete
clc element.
The
clc element’s integrase showed significant amino acid sequence homology to integrases from bacteriophages like φR73, P4, and Sf6 (
13,
17,
24,
37) (Fig.
3B). A high degree of amino acid sequence homology was also found between Int-B13 and the integrase IntS from the 500-kb symbiosis island of
Mesorhizobium loti(
36) and between Int-B13 and the integrase from the
vap region of
D. nodosus (
11,
12). The majority of these P4-type integrases mediate site-specific integrative recombination involving tRNA structural genes. For instance, retronphage φR73 integrates into a
sel-tRNA gene (
37), satellite phage P4 integrates into a
leu-tRNA (
24), the symbiosis island from
M. loti uses a
phe-tRNA as a target site (
36), and the
vap region from
D. nodosus seems to integrate into a
ser-tRNA gene (
11,
12). Similar to the observations for the
clc element, these integrases mediate insertions into the 3′ ends of their target tRNA genes. Upon integration, the 3′ portion of the tRNA gene at
attB is replaced by an identical segment carried on
attP. For the
clc element, this identity segment had a length of 18 bp. The exact reconstruction of the gene sequence of the target tRNA’s 3′ end is an important feature in maintaining its essential function (
9). Reiter et al. (
28) pointed out that the identity segments of many elements inserting into tRNA genes extend from the anticodon loop through the 3′ end. However, the identity segments of the
att sites from the
clc element (18 bp) (Fig.
1B), phage P4 (20 bp), the
vap region of
D. nodosus (19 bp), and the
M. loti symbiosis island (17 bp) are shorter, extending from the TψC loop through the 3′ end. Regions of dyad symmetry characteristic of tRNA genes are supposed to serve as integrase binding sites (
28). However, this can be true only for
attB or
attL DNA containing the complete tRNA gene and not for the corresponding
attP site which contains only the 3′ portion of the sequence.
The complete functional Int-B13 protein had a considerably higher molecular mass than other known P4-type integrases. Even so, several smaller ORFs were found in frame with the largest coding region (Fig.
1B). Although not investigated, the translational start of Int-B13 may be ATG at nt 304 rather than that at nt 262, due to a better ribosome binding site. Other downstream translational starts may result in truncated forms of Int-B13 lacking part of the N-terminal domain. For the conjugative transposon Tn
916, truncated integrase proteins are thought to be involved in regulating recombinational activity (
32) by interacting with the full-length integrase protein or the attachment sites. Since Int-B13 is so much longer than other P4-related integrases and the C-terminal region is not homologous to the site-specific recombinases, the ORF starting at Val-453 could have a different role. Perhaps this part codes for the excionase function, which is typically clustered together with an integrase. The excionase stimulates excisive recombination and is usually a small protein of 60 to 120 aa (
5,
30). The C-terminal region of Int-B13 did not show homology with known excionases, but this type of protein normally has very little homology (
7,
30). The only sequence with significant homology to the C-terminal domain of Int-B13 was an ORF originating in
P. aeruginosa PaK1 (
38). The ORF is located upstream of an NAH7-like
pah gene cluster, putatively encoding naphthalene degradation. Interestingly, a translation of the (published) nucleotide sequence upstream of the ORF also revealed the RRMMQDWADRLDL residue motif, which forms the last conserved region in the C termini of the P4-related integrases (Fig.
3). Therefore, we suspect that the proposed ORF represents the C-terminal domain of a larger ORF, similar to Int-B13. No function has yet been assigned to the ORF flanking the
pah cluster, but the entire gene cluster is thought to be part of a mobile element (
39).
The previously isolated and characterized plasmid pB13 carrying the
clc genes in
Pseudomonas sp. strain B13 (
10) seems to be identical to our integrated
clcelement (
25). The fact that other research groups have been unable to isolate plasmid DNA from strain B13 (
22,
41) was probably due to the
clc element’s integration into the chromosome. Our observations at the moment indicate that the extrachromosomal circular form of the element is abundant only in strain B13 in stationary phase during growth on 3CBA (results not shown). Interestingly, in most other transconjugants analyzed so far, none or very little of the circular form can be detected by PCR amplification (results not shown). How excision and transfer of the
clc element are regulated will therefore have to be studied more extensively.
It is becoming clear that a new form of mobile genetic elements exists, which we propose to call “gene islands” (after the use of the terms pathogenicity island and symbiosis island). Such elements harbor integrases related to those of bacteriophages and may have been evolutionarily derived from those. Examples include insertional plasmids from actinomycetes (
5-8), some of the pathogenicity islands from gram-negative pathogens (
11,
12,
18), and a recently discovered 500-kb transferable region (symbiosis island) from
M. loti (
36). This symbiosis island seems to carry all the genetic information required for nodule formation, symbiotic nitrogen fixation, and synthesis of three vitamins. For the first time, our work demonstrated that a bacteriophage-related integrase was associated with horizontal transfer of genes coding for degradation of xenobiotics, and the
clcelement could therefore be considered a “degradation island.” Apparently, bacteriophage-related integrases using tRNA structural genes as (chromosomal) insertion sites are involved in horizontal transfer of very diverse genetic determinants, not only those of bacterial virulence (
11). This class of integrating elements may have been underestimated and could have greater evolutionary importance than previously thought.