Abstract: At present, a large number of compounds of known structure lack basic physical property data and thermodynamic data. In order to promote data integrity and usability in chemistry database, the study calculates the structure of about 200,000 compounds using Gaussian03 software programs based on compounds structure data and compounds profiles. Meanwhile, the study performs geometric structure optimization, thermodynamic data analysis, and spectrum analysis. Then, compound species, together with their thermodynamic parameters and spectrum data, are extracted from quantum calculation results according to international standards, including infrared absorption spectrum, dipole moment, absolute polarizability, turning temperature and constant, zero-point vibrational energy, zero-point correction, molecular internal energy, molecular kinetic energy, enthalpy, free energy, heat capacity at constant volume, and entropy. A second processing (i.e., data analysis, mining and duplicate checking) results in quantum chemical calculation data of 18000 compounds (including spectrum data of 5321 compounds). To improve data reusability, all the resulted data have been standardized and processed for easier reuse.
Keywords: compounds structure; quantum chemical calculation; data analysis and processing; thermodynamic data; spectrum data