WebClass is a prototypical workbench written in
Java for experimenting the application of Statistical and Case-Based
Reasoning methods to automatic Web page classification.
Platform : Any Java 1.1 (or higher) enabled
Installation Procedure & Testing
The distribution package webclass.zip contains the following files:
To install, make the following steps:
o Unzip the file webclass.zip into a directory you have chosen
o Go to this directory
o Extract webclass.jar with command jar -xfM webclass.jar
o Extract html.jar with command jar -xfM html.jar
o Extract experiment1.jar with command jar -xfM experiment1.jar
o Extract experiment2.jar with command jar -xfM experiment2.jar
The system User Manual and other
documentation is not yet available. Anyway you can test the system by doing
the experiments described hereunder. The system is not "stable" (it
is written by our university students and its main aim is to
"demonstrate" hopefully good ideas). Therefore, please be patient
for slowly running or for eventually system errors.
The First Experiment
You can explore the system features by using
the experiment 1 knowledge base: for example, you can classify a Web page by
selecting the menu-item "Page" from the pop-up menu
"Classify". A micro-browser will appear allowing you to load a web
page from the directory html (buttons: openfile and reload) and classify it by pushing one of the
classification buttons: Classify by Centroids, Classyfy by NN, Classify by
The Second Experiment
We started to embed the best results produced
by WebClass into WBI plug-ins with the aim of building
"intelligent proxy servers".
· F. Esposito, D. Malerba, L. Di Pace, & P. Leo (2000). A Machine Learning Approach to Web Mining, In E. Lamma & P. Mello (Eds.), AI*IA 99: Advances in Artificial Intelligence, Lecture Notes in Artificial Intelligence, 1321, 439-442, Springer, Berlin, Germany.
· F. Esposito, D. Malerba, L. Di Pace, & P. Leo (2000). WebClass: An Intermediary for the Classification of HTML Pages, Demo paper for AI*IA '99, Bologna, Italy.
· G. Convertino, L. di Pace, P. Leo, A. Maffione, D. Malerba & G. Vespucci. Tecniche di Web Mining per supportare l'attivitą di navigazione in rete, Proceedings of AICA '98, 53-74, Naples, Italy.
None yet available. Send all
requests/comments to: Pietro Leo, IBM Java Technology
Center, Bari (Italy).
Last modified 10/01/2000