In order to build an "advanced" compound-sentence corpus for Chinese Information Process,automatic word segmentation and POS tagging work should be completed first of all.Then on this basis,automatic classification and labeling of levels and relationship between clauses should be conducted.As punctuation marks are the most intuitive and clear marks,we programmed the computer to regard the language fragments between punctuation as clauses.Doing so much is risking,because it will "victimize" a lot of non-clause language fragments...