Zakaria Suliman Zubi Sirt University, Faculty of Science, Computer Science Department Sirte, P.O Box 727, Libya, {zszubi@yahoo.com} Abstract: The internet is a huge source of documents, containing a massive number of texts in multilingual languages on a wide range of topics. These texts are demonstrating in an electronic documents format hosted on the web. The documents exchanged using special forms in an Electronic Data Interchange (EDI) environment. Using web text mining approaches to mine documents in EDI environment could be new challenging guidelines in web text mining. Applying text1mining approaches to discover knowledge previously unknown patters retrieved from the web documents by using partitioned cluster analysis methods such as k1 means methods using Euclidean distance measure algorithm for EDI text document datasets is unique area of research these days. Our experiments utilize the standard K1means algorithm on EDI text documents dataset that most commonly used in electronic interchange and we report some results using text mining clustering application solution called WEKA. This study will provide high quality services to any organization that is willing to use the system. KeyWords: Electronic Data Interchange (EDI), Web Mining, Text Mining, Clustering, K1 mean algorithm, Similarity Measures, Partitioned Cluster Analysis. The growth of the stored electronic documents is increasing day by day on the web. These documents contain an electronic media for a particular end user represented in texts, pictures, audios and videos format. The electronic texts in these documents characterized in multilingual languages and classified into two catalogs such as Latin and non1 Latin languages. These languages correspond to the electronic text contents of the documents stored on the web. The text contents became the most important item in the document and the most frequently distributed in that document as well. Electronic documents on the internet had a tremendous number of electronic texts defined in million of topics. Internet users actively are exchanging documents with each other asking about subjects of interest or sending requests to Web1 based expert forums, or any other services in electronic text forms. Organizations such as governments and companies institutions are exchanging electronics documents in different form throughout the internet media in a security environment called electronic documents interchange (EDI). The electronic documents interchange (EDI) is a computer1to1computer exchange environment of electronic documents between organizations. EDI replaces the faxing and mailing of paper documents. EDI documents uses a precise computer record formats based on a commonly accepted standards. However, each organization will use the flexibility allowed by the standards in a unique way that fits their daily inquires needs. The data in the EDI documents mainly represented in text formats translated from one host to another through a network media. EDI usually transfers text data between different originations using internet or network environments. Such environment could be a VANs or the Internet. As more and more organizations connected to the Internet, EDI is becoming progressively more significant as an easy mechanism for organizations to manage, buy, sell, and trade information. ANSI has approved a set of EDI standards known as the X12 standards. Theses standards play a necessary condition for organizations to join EDI community. Moreover, the X12 standards developed uniform standards for inter industry electronic exchange of business and managements transactions electronic data interchange (EDI). EDI standards used as a national format based on the organization location and activity. Each international format is an international EDI standard designed to meet the needs of both WSEAS TRANSACTIONS on COMPUTERS Zakaria Suliman Zubi ISSN: 1109-2750 832 Issue 8, Volume 9, August 2010