Hierarchical characterization and generation of blogosphere workloads

Mariela Josefina Curiel Huérfano, Azer Bestavros, Fernando Duarte, Bernardo Mattos, Jussara Almeida, Virgilio Almeida

Producción: Contribución a una revistaArtículo

Resumen

We present a thorough characterization of the access patterns in blogspace, which comprises a rich interconnected web of blog postings and comments by an increasingly prominent user community that collectively define what has become known as the blogosphere. Our characterization of over 35 million read, write, and management requests spanning a 28-day period is done at three different levels. The user view characterizes how individual users interact with blogosphere objects (blogs); the object view characterizes how individual blogs are accessed; the server view characterizes the aggregate access patterns of all users to all blogs. The more-interactive nature of the blogosphere leads to interesting traffic and communication patterns, which are different from those observed for traditional web content. We identify and characterize novel features of the blogosphere workload, and we show the similarities and differences between typical web server workloads and blogosphere server workloads. Finally, based on our main characterization results, we build a new synthetic blogosphere workload generator called GBLOT, which aims at mimicking closely a stream of requests originating from a population of blog users. Given the increasing share of blogspace traffic, realistic workload models and tools are important for capacity planning and traffic engineering purposes.
Idioma originalInglés
Páginas (desde-hasta)1-34
Número de páginas34
PublicaciónComputer Science, Boston University, Tech. Rep
EstadoPublicada - 2008
Publicado de forma externa

Huella

Profundice en los temas de investigación de 'Hierarchical characterization and generation of blogosphere workloads'. En conjunto forman una huella única.

Citar esto