This blog provides users with various sources and information on Arabic blogs.

Sunday, August 5, 2007

Arabic blogs dataset for free download

the Arabic blogs datset consists of approximately 12.000 Arabic blogs containing a number of posts that exceeds 120.300 posts. The oldest blog post in the dataset dates back to 2002. The dataset contains also relevant blog data.

The blog data consist of information about blogs such as title, description, and URL address. In addition, it consists of blog posts data like author's name, date of publication, and content of the post.

To download the Arabic blogs dataset from the page of Arabic blogs at the ILPS, University of Amsterdam. Click here

You can also download the dataset from my home page by clicking to the following link