Download citation
Download citation
link to html
The title mol­ecule, C15H23N3O5S, was prepared as a synthetic precursor to 4-methyl­thia­zole-based DNA minor groove binders which would bear chiral amino acids in the sequence. The crystallographic evidence presented herein shows that the aromatic amide NH group preceding the thia­zole ring points away from the direction of sulfur. The mol­ecule is biplanar, with the dihedral angle between the N-terminus peptide moiety and the thia­zole-containing plane being 49.7 (5)°, with a bend at the C[alpha] carbon.

Supporting information

cif

Crystallographic Information File (CIF) https://doi.org/10.1107/S0108270100012439/vj1112sup1.cif
Contains datablocks global, I

hkl

Structure factor file (CIF format) https://doi.org/10.1107/S0108270100012439/vj1112Isup2.hkl
Contains datablock I

CCDC reference: 156172

Comment top

One approach to the general problem of developing DNA sequence-specific reagents is by modifying the lead compounds based on the principles of DNA recognition. Distamycin (Dst) and Netropsin (Nt) are DNA binding compounds that bind in the minor groove of AT rich sequences of DNA. The determinants in the DNA recognition of these molecules were deciphered from the crystal structures of free Nt (Berman et al., 1979) and also that of the complexes of Nt (Kopka et al., 1985) and Dst (Coli et al., 1987) with the respective target DNA sequences. The sequence-specificity in the binding of these molecules is a result of the hydrogen-bonding interactions between the amide NH groups and the adenine N3 or thymine O2 atoms, and the close van der Waals contact between the C3(H) of the pyrrole residues and thymine –CH3 groups. Replacement of pyrrole by other heterocyclic moieties capable of specific DNA recognition by hydrogen-bond acceptance, such as imidazole, pyridine or oxazole led to the generation of molecules that recognize GC rich sequences of DNA and are termed `lexitropsins' or information reading molecules (for recent reviews see: Wemmer & Dervan, 1997; Baily & Chaires, 1998).

Thiazole lexitropsins with amino and carboxy substituents at 2- and 5- positions, respectively, along with alkyl substitution at position 4- (type II thiazole lexitropsins) are expected to be more discriminating than the parent antibiotics Dst and Nt against GC tolerance (Rao et al., 1990), as this substitution pattern favours orientation of S towards the minor groove of DNA.

Yet another important component of molecular recognition is ligand chirality and one way of modulating the sequence specificity and affinity of prototype lexitropsins is by intoducing chiral amino acids (Herman et al., 1998). As a part of our continuing effort in the development of DNA binding molecules (Bhattacharya & Thomas, 1998, 2000a, 2000b) we synthesized the title compound, (I), a dipeptide precursor in the synthesis of type II thiazole containing lexitropsins. We attempted to solve the crystal structure of this molecule to determine (a) the relative orientation of the amide unit preceding the thiazole ring with respect to the S atom, (b) the overall shape of the molecule. \sch

The title molecule (see Fig. 1) was found to be biplanar with the dihedral angle between the mean plane (A) containing N terminus peptide moeity, (C3—C4—O1—C0'-N1—C1A) and the mean plane (B) containing the thiazole ring, (C1'-N2—C5—N3—C6—C7—S1—C8'-O9—C10) being 49.7 (5)° with a bend at the Cα carbon·The maximum deviation of the atoms defining the two planes is 0.100 (4) and 0.040 (4) Å for planes (A) and (B), respectively. The thiazole ring is planar, with a maximum deviation of the ring atoms from the plane being 0.005 (4) Å. Within the ring, the bonding angle at S1, (C5—S1—C7) is 88.36 (15)°, which is unusually small. The length of the C—S bonds is also on the lower side with average of 1.715 (4) Å. The carbonyl groups that are on either side of the thiazole ring lie almost in the plane of the ring [deviations 0.063 (4) and 0.039 (4) Å, respectively, for O1' and O8'], and are pointing in the same direction as sulfur. The non-bonded distances of O1' and O8' from S1 are 2.737 (4) and 2.939 (4) Å, respectively, which are significantly shorter than the sum of van der Waals radii of S and O (3.50 Å) (Bondi, 1964).

It is interesting to note that the aromatic amide N—H preceeding thiazole ring points in the direction opposite to that of the sulfur atom. As mentioned earlier, minor groove binding of type II thiazole containing lexitropsins would require the preceeding amide N—H to be pointed in the same direction as that of the sulfur atom. It is worthwhile to note at this point that in the NMR structure of a thiazole/pyrrole lexitropsin complexed with a dodecanucleotide (Kumar et al., 1990), the N-terminus thiazole unit was found to be binding in intercalative fashion rather than minor groove binding. This unusual observation could be explained in the light of the present structure. In the absence of the preceding amide –NH pointing towards the minor groove, the planar aromatic ring of thiazole containing lexitropsins could act as an intercalating chromophore, as observed in the case mentioned above. Thus the results of the observations emerging from the present structure should help the future design of thiazole based DNA binding compounds.

Experimental top

The synthesis of the title compound was carried out by coupling of the t-Boc -alanine with ethyl 4-methyl-2-aminothiazole-5-carboxylate in the presence of dicyclohexyl carbodiimide and hydroxybenzotriazole. The pure compound was isolated in 96% yield upon coloumn chromatography over silica gel using 20–30% petrolium ether/ethyl acetate as eluent. The diffraction quality crystals were obtained by layering acetone solution of (I) (300 µl, 50 mM) over 5 ml of water in a test tube. 1H NMR: 200 MHz (CDCl3, δ p.p.m.), 1.36 (t, 3H), 1.45 (s, 9H), 1.5 (3H merging with the Boc peak), 2.57 (s, 3H), 4.29 (q, 2H), 4.55 (bs, 1H), 5.47 (bs, 1H), 10.64 (bs, 1H). IR: (Nujol, ν, cm−1) 3269, 1707, 1667, 1527.

Refinement top

Molecule (I) crystallized in the monoclinic system; space group C2, Cm or C2/m from systematic absences; C2 was chosen and confirmed by the analysis. H atoms were treated as riding atoms (C—H3 0.96, C—H2 0.97,C—H 0.98, N—H 0.86). The distance between C10 and C11 was fixed at 1.500 (3) Å to avoid the shortening due to thermal vibration.

Computing details top

Data collection: CAD-4 Software (Enraf-Nonius, 1989); cell refinement: CAD-4 Software; data reduction: MolEN (Fair, 1990); program(s) used to solve structure: SHELXS97 (Sheldrick, 1990); program(s) used to refine structure: SHELXL97 (Sheldrick, 1997); molecular graphics: PLATON (Spek, 1990) and ORTEP-3 (Farrugia, 1997); software used to prepare material for publication: PLATON and SHELXL97.

Figures top
[Figure 1] Fig. 1. View of the title compound with the atom-numbering scheme. Displacement ellipsoids for non-H atoms are drawn at the 50% probability level.
Ethyl 2-[N-(tert-butyloxycarbonyl)-L-alanylamino]-4-methyl-1,3-thiazole-5 -carboxylate top
Crystal data top
C15H23N3O5SF(000) = 760
Mr = 357.42Dx = 1.265 Mg m3
Monoclinic, C2Cu Kα radiation, λ = 1.54180 Å
a = 19.2711 (10) ÅCell parameters from 25 reflections
b = 9.769 (3) Åθ = 10–35°
c = 10.2592 (10) ŵ = 1.78 mm1
β = 103.666 (13)°T = 293 K
V = 1876.8 (6) Å3Plate, colourless
Z = 40.13 × 0.06 × 0.04 mm
Data collection top
ENRAF-NONIUS CAD-4
diffractometer
1756 reflections with I > 2σ(I)
Radiation source: fine-focus sealed tubeRint = 0.021
Graphite monochromatorθmax = 69.9°, θmin = 4.4°
ω/2θ scansh = 023
Absorption correction: ψ scan
(North et al., 1968)
k = 011
Tmin = 0.738, Tmax = 0.931l = 1212
1944 measured reflections2 standard reflections every 60 min
1886 independent reflections intensity decay: <2.0%
Refinement top
Refinement on F2Hydrogen site location: inferred from neighbouring sites
Least-squares matrix: fullH-atom parameters constrained
R[F2 > 2σ(F2)] = 0.038 w = 1/[σ2(Fo2) + (0.074P)2 + 0.4897P]
where P = (Fo2 + 2Fc2)/3
wR(F2) = 0.109(Δ/σ)max = 0.018
S = 1.04Δρmax = 0.34 e Å3
1886 reflectionsΔρmin = 0.18 e Å3
207 parametersExtinction correction: SHELXL, Fc*=kFc[1+0.001xFc2λ3/sin(2θ)]-1/4
2 restraintsExtinction coefficient: 0.0027 (3)
Primary atom site location: structure-invariant direct methodsAbsolute structure: Flack,(1983)
Secondary atom site location: difference Fourier mapAbsolute structure parameter: 0.03 (2)
Crystal data top
C15H23N3O5SV = 1876.8 (6) Å3
Mr = 357.42Z = 4
Monoclinic, C2Cu Kα radiation
a = 19.2711 (10) ŵ = 1.78 mm1
b = 9.769 (3) ÅT = 293 K
c = 10.2592 (10) Å0.13 × 0.06 × 0.04 mm
β = 103.666 (13)°
Data collection top
ENRAF-NONIUS CAD-4
diffractometer
1756 reflections with I > 2σ(I)
Absorption correction: ψ scan
(North et al., 1968)
Rint = 0.021
Tmin = 0.738, Tmax = 0.9312 standard reflections every 60 min
1944 measured reflections intensity decay: <2.0%
1886 independent reflections
Refinement top
R[F2 > 2σ(F2)] = 0.038H-atom parameters constrained
wR(F2) = 0.109Δρmax = 0.34 e Å3
S = 1.04Δρmin = 0.18 e Å3
1886 reflectionsAbsolute structure: Flack,(1983)
207 parametersAbsolute structure parameter: 0.03 (2)
2 restraints
Special details top

Geometry. All e.s.d.'s (except the e.s.d. in the dihedral angle between two l.s. planes) are estimated using the full covariance matrix. The cell e.s.d.'s are taken into account individually in the estimation of e.s.d.'s in distances, angles and torsion angles; correlations between e.s.d.'s in cell parameters are only used when they are defined by crystal symmetry. An approximate (isotropic) treatment of cell e.s.d.'s is used for estimating e.s.d.'s involving l.s. planes.

Refinement. Refinement of F2 against ALL reflections. The weighted R-factor wR and goodness of fit S are based on F2, conventional R-factors R are based on F, with F set to zero for negative F2. The threshold expression of F2 > σ(F2) is used only for calculating R-factors(gt) etc. and is not relevant to the choice of reflections for refinement. R-factors based on F2 are statistically about twice as large as those based on F, and R- factors based on ALL data will be even larger.

Fractional atomic coordinates and isotropic or equivalent isotropic displacement parameters (Å2) top
xyzUiso*/Ueq
C10.2714 (3)0.8387 (14)0.1010 (5)0.160 (4)
H1A0.26620.74960.06500.240*
H1B0.27520.83000.19220.240*
H1C0.23050.89370.09800.240*
C20.4031 (3)0.8156 (7)0.0015 (5)0.1119 (19)
H2A0.44240.85690.06440.168*
H2B0.41570.80300.08290.168*
H2C0.39240.72840.03530.168*
C30.35335 (17)1.04047 (13)0.07835 (15)0.119 (2)
H3A0.31111.09600.09560.179*
H3B0.36701.02350.16100.179*
H3C0.39131.08740.01730.179*
C40.33876 (17)0.90706 (13)0.01775 (15)0.0731 (10)
O10.32967 (6)0.94856 (12)0.11453 (11)0.0680 (6)
C0'0.31757 (7)0.85388 (13)0.20182 (11)0.0585 (7)
O0'0.30309 (6)0.73517 (12)0.17723 (12)0.0901 (9)
N10.32382 (7)0.91199 (13)0.32228 (11)0.0641 (7)
H10.33260.99820.33200.077*
C1A0.31590 (7)0.83149 (13)0.43640 (11)0.0581 (7)
H1AA0.30790.73560.40890.070*
C1B0.3822 (2)0.8409 (6)0.5504 (4)0.0861 (12)
H1B10.42240.80420.52160.129*
H1B20.37500.78940.62580.129*
H1B30.39130.93490.57600.129*
C1'0.25284 (19)0.8821 (3)0.4848 (3)0.0509 (7)
O1'0.23896 (15)1.0042 (2)0.4951 (3)0.0661 (6)
N20.21172 (13)0.7829 (3)0.5221 (2)0.0511 (5)
H20.22280.69910.51130.061*
C50.15358 (15)0.8067 (3)0.5757 (3)0.0475 (6)
N30.11963 (14)0.7040 (3)0.6120 (3)0.0542 (6)
C60.06421 (17)0.7486 (4)0.6642 (3)0.0559 (7)
C6'0.0186 (2)0.6434 (4)0.7081 (5)0.0801 (11)
H6'10.03930.55450.70510.120*
H6'20.02830.64520.64940.120*
H6'30.01520.66280.79820.120*
C70.05840 (17)0.8894 (3)0.6674 (3)0.0551 (7)
S10.12288 (4)0.96626 (6)0.60170 (8)0.0570 (2)
C8'0.00871 (18)0.9806 (5)0.7128 (4)0.0671 (8)
O8'0.01048 (17)1.1029 (3)0.7036 (4)0.0948 (10)
O90.03728 (16)0.9129 (3)0.7656 (3)0.0872 (8)
C100.0873 (3)1.0027 (6)0.8134 (6)0.1105 (17)
H10A0.12141.04320.73850.133*
H10B0.06181.07540.86920.133*
C110.1240 (4)0.9120 (7)0.8925 (9)0.157 (3)
H11A0.14860.84040.83580.235*
H11B0.15770.96450.92710.235*
H11C0.08930.87240.96570.235*
Atomic displacement parameters (Å2) top
U11U22U33U12U13U23
C10.117 (4)0.284 (12)0.070 (3)0.029 (6)0.002 (2)0.020 (5)
C20.143 (4)0.108 (4)0.095 (3)0.058 (4)0.047 (3)0.002 (3)
C30.175 (5)0.110 (4)0.093 (3)0.041 (4)0.069 (4)0.034 (3)
C40.082 (2)0.083 (3)0.0574 (17)0.019 (2)0.0235 (15)0.0046 (17)
O10.0926 (14)0.0556 (14)0.0639 (11)0.0128 (12)0.0345 (10)0.0073 (11)
C0'0.0647 (16)0.0499 (19)0.0630 (16)0.0007 (14)0.0194 (13)0.0012 (14)
O0'0.132 (2)0.0593 (17)0.0757 (16)0.0286 (16)0.0191 (16)0.0123 (14)
N10.101 (2)0.0345 (13)0.0694 (15)0.0052 (13)0.0466 (14)0.0001 (11)
C1A0.0784 (18)0.0328 (13)0.0717 (17)0.0004 (14)0.0345 (14)0.0066 (14)
C1B0.076 (2)0.089 (3)0.095 (3)0.014 (2)0.0232 (18)0.025 (3)
C1'0.0704 (18)0.0304 (15)0.0561 (15)0.0016 (11)0.0236 (13)0.0004 (11)
O1'0.0869 (15)0.0273 (12)0.0958 (15)0.0012 (9)0.0450 (13)0.0027 (10)
N20.0693 (14)0.0275 (10)0.0622 (13)0.0019 (10)0.0270 (11)0.0000 (10)
C50.0631 (15)0.0298 (13)0.0502 (12)0.0023 (11)0.0149 (11)0.0006 (10)
N30.0690 (15)0.0308 (12)0.0674 (14)0.0022 (10)0.0251 (12)0.0010 (10)
C60.0595 (16)0.0424 (15)0.0680 (17)0.0007 (13)0.0194 (13)0.0026 (13)
C6'0.086 (2)0.055 (2)0.112 (3)0.0030 (19)0.047 (2)0.010 (2)
C70.0591 (17)0.0458 (17)0.0608 (16)0.0043 (13)0.0147 (13)0.0011 (14)
S10.0666 (4)0.0318 (4)0.0743 (5)0.0025 (3)0.0201 (3)0.0004 (3)
C8'0.0628 (17)0.056 (2)0.082 (2)0.0035 (17)0.0156 (15)0.0074 (18)
O8'0.0875 (18)0.0509 (16)0.149 (3)0.0145 (13)0.0331 (18)0.0126 (17)
O90.0866 (17)0.0715 (19)0.117 (2)0.0099 (14)0.0518 (15)0.0043 (16)
C100.106 (3)0.096 (4)0.148 (4)0.019 (3)0.066 (3)0.015 (3)
C110.150 (6)0.113 (6)0.252 (9)0.004 (4)0.135 (7)0.005 (6)
Geometric parameters (Å, º) top
C1—C41.529 (7)N2—C51.380 (4)
C2—C41.504 (5)C5—N31.299 (4)
C3—C41.499 (2)C5—S11.710 (3)
C4—O11.466 (2)N3—C61.374 (4)
O1—C0'1.346 (2)C6—C71.381 (5)
C0'—O0'1.206 (2)C6—C6'1.490 (5)
C0'—N11.339 (2)C7—C8'1.463 (5)
N1—C1A1.448 (2)C7—S11.720 (3)
C1A—C1'1.501 (4)C8'—O8'1.199 (6)
C1A—C1B1.518 (4)C8'—O91.320 (5)
C1'—O1'1.233 (4)O9—C101.470 (5)
C1'—N21.362 (4)C10—C111.489 (3)
O1—C4—C2108.5 (2)C1'—N2—C5125.0 (3)
O1—C4—C3102.6 (1)N3—C5—N2119.7 (3)
C2—C4—C3110.0 (3)N3—C5—S1116.3 (2)
O1—C4—C1111.5 (3)N2—C5—S1124.0 (2)
C2—C4—C1112.3 (5)C5—N3—C6110.9 (3)
C3—C4—C1111.5 (4)N3—C6—C7113.6 (3)
C0'—O1—C4120.2 (1)N3—C6—C6'117.9 (3)
O0'—C0'—N1124.5 (1)C7—C6—C6'128.5 (3)
O0'—C0'—O1126.2 (1)C6—C7—C8'132.6 (3)
N1—C0'—O1109.3 (1)C6—C7—S1110.8 (3)
C0'—N1—C1A120.8 (1)C8'—C7—S1116.5 (3)
N1—C1A—C1'109.71 (12)C5—S1—C788.36 (15)
N1—C1A—C1B111.09 (18)O8'—C8'—O9124.4 (4)
C1'—C1A—C1B109.0 (2)O8'—C8'—C7123.3 (4)
O1'—C1'—N2120.8 (3)O9—C8'—C7112.3 (4)
O1'—C1'—C1A123.8 (3)C8'—O9—C10113.2 (4)
N2—C1'—C1A115.4 (2)O9—C10—C11105.1 (4)
C2—C4—O1—C0'61.7 (3)S1—C5—N3—C60.5 (3)
C3—C4—O1—C0'178.1 (1)C5—N3—C6—C70.8 (4)
C1—C4—O1—C0'62.5 (5)C5—N3—C6—C6'178.7 (3)
C4—O1—C0'—O0'11.18 (15)N3—C6—C7—C8'179.8 (3)
C4—O1—C0'—N1169.05 (15)C6'—C6—C7—C8'0.4 (7)
O0'—C0'—N1—C1A2.4 (2)N3—C6—C7—S10.8 (4)
O1—C0'—N1—C1A177.8 (1)C6'—C6—C7—S1178.6 (3)
C0'—N1—C1A—C1'117.15 (14)N3—C5—S1—C70.0 (2)
C0'—N1—C1A—C1B122.3 (2)N2—C5—S1—C7178.8 (3)
N1—C1A—C1'—O1'42.4 (4)C6—C7—S1—C50.4 (3)
C1B—C1A—C1'—O1'79.4 (4)C8'—C7—S1—C5179.6 (3)
N1—C1A—C1'—N2139.6 (2)C6—C7—C8'—O8'177.4 (4)
C1B—C1A—C1'—N298.6 (3)S1—C7—C8'—O8'1.5 (5)
O1'—C1'—N2—C51.4 (5)C6—C7—C8'—O93.1 (6)
C1A—C1'—N2—C5176.6 (2)S1—C7—C8'—O9177.9 (2)
C1'—N2—C5—N3177.6 (3)O8'—C8'—O9—C100.1 (6)
C1'—N2—C5—S11.1 (4)C7—C8'—O9—C10179.3 (4)
N2—C5—N3—C6179.3 (2)C8'—O9—C10—C11169.0 (5)
Hydrogen-bond geometry (Å, º) top
D—H···AD—HH···AD···AD—H···A
N1—H1···N3i0.862.233.071 (3)166
N2—H2···O1ii0.862.052.903 (4)173
C3—H3A···O0iii0.962.553.506 (3)172
Symmetry codes: (i) x+1/2, y+1/2, z+1; (ii) x+1/2, y1/2, z+1; (iii) x+1/2, y+1/2, z.

Experimental details

Crystal data
Chemical formulaC15H23N3O5S
Mr357.42
Crystal system, space groupMonoclinic, C2
Temperature (K)293
a, b, c (Å)19.2711 (10), 9.769 (3), 10.2592 (10)
β (°) 103.666 (13)
V3)1876.8 (6)
Z4
Radiation typeCu Kα
µ (mm1)1.78
Crystal size (mm)0.13 × 0.06 × 0.04
Data collection
DiffractometerENRAF-NONIUS CAD-4
diffractometer
Absorption correctionψ scan
(North et al., 1968)
Tmin, Tmax0.738, 0.931
No. of measured, independent and
observed [I > 2σ(I)] reflections
1944, 1886, 1756
Rint0.021
(sin θ/λ)max1)0.609
Refinement
R[F2 > 2σ(F2)], wR(F2), S 0.038, 0.109, 1.04
No. of reflections1886
No. of parameters207
No. of restraints2
H-atom treatmentH-atom parameters constrained
Δρmax, Δρmin (e Å3)0.34, 0.18
Absolute structureFlack,(1983)
Absolute structure parameter0.03 (2)

Computer programs: CAD-4 Software (Enraf-Nonius, 1989), CAD-4 Software, MolEN (Fair, 1990), SHELXS97 (Sheldrick, 1990), SHELXL97 (Sheldrick, 1997), PLATON (Spek, 1990) and ORTEP-3 (Farrugia, 1997), PLATON and SHELXL97.

Selected geometric parameters (Å, º) top
C3—C41.499 (2)C5—N31.299 (4)
C4—O11.466 (2)C5—S11.710 (3)
O1—C0'1.346 (2)N3—C61.374 (4)
C0'—O0'1.206 (2)C6—C71.381 (5)
C0'—N11.339 (2)C7—C8'1.463 (5)
N1—C1A1.448 (2)C7—S11.720 (3)
C1'—O1'1.233 (4)C8'—O8'1.199 (6)
C1'—N21.362 (4)C8'—O91.320 (5)
N2—C51.380 (4)
C0'—O1—C4120.2 (1)C5—S1—C788.36 (15)
C7—C6—C6'128.5 (3)C8'—O9—C10113.2 (4)
C3—C4—O1—C0'178.1 (1)C1'—N2—C5—S11.1 (4)
C4—O1—C0'—N1169.05 (15)C6—C7—C8'—O8'177.4 (4)
O1—C0'—N1—C1A177.8 (1)S1—C7—C8'—O8'1.5 (5)
C0'—N1—C1A—C1'117.15 (14)C7—C8'—O9—C10179.3 (4)
C1A—C1'—N2—C5176.6 (2)C8'—O9—C10—C11169.0 (5)
C1'—N2—C5—N3177.6 (3)
 

Follow Acta Cryst. C
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds