Author: Wanis, Bassem Victor Fekry./ Title: Towards Enhancing The Accuracy of Clustering XML Documents/

Search In this Thesis

العنوان

Towards Enhancing The Accuracy of Clustering XML Documents/

المؤلف

Wanis, Bassem Victor Fekry.

هيئة الاعداد

مشرف / باسم فكتور فكرى و نيس

مشرف / محمد سعيد حلمى أبوجبل

مشرف / نجوى مصطفى إسماعيل المكى

مناقش / مجدى حسين محمود راتب ناجى

مشرف / أمانى أنور أحمد سعد

الموضوع

Computer Science.

تاريخ النشر

2011 .

عدد الصفحات

78 p. :

اللغة

الإنجليزية

الدرجة

ماجستير

التخصص

الهندسة الكهربائية والالكترونية

تاريخ الإجازة

1/6/2011

مكان الإجازة

جامعة الاسكندريه - كلية الهندسة - حاسب الى

الفهرس

Only 14 pages are availabe for public view

from

Abstract

With the continuous growth of XML documents, clustering of these documents has be¬come an active research area. This thesis proposes a novel technique that explores both the content and structure of XML documents for determining similarity among them. As the content and the structure of XML documents play different roles and have different impor¬tance depending on the use and purpose of a dataset, the proposed technique separates the content similarity process from the structure similarity process, and then uses appropriate weights to combine the content similarity and the structure similarity based on the type of XML documents. The proposed technique can be configured to target both rigorously struc¬tured fine-grained XML documents and loosely structured coarse-grained XML documents. It can also be configured to target both homogenous and heterogeneous XML documents. Several experiments were conducted to evaluate the accuracy and the scalability of the pro¬posed technique and to compare it with state-of-the-art techniques. The results show the effectiveness of the proposed technique.