Clean Web Search (00015)
This dataset contains the results of comparing websearches across Bing, Google, Yahoo, and Ask. This data is provided by Robert Bredereck at TU Berlin. Robert provides tools to compute Kemeny rankings on this data at his website at TU Berlin.
These data files differ from the other set of web data in that these files are forced to be complete. This means that the results are restricted to only those candidates (sites) that appear in all three datasets. The data files marked big contain around 200 (max 242) candidates each while the data files marked small contain between 10 and 50 candidates. The search querys are shown in the names of the individual data files below. For the WebImpact files the number of search results for a particular term were used to creage a complete ranking over the search terms. These files measure the webimpact of various world cities and countries. We have extended this data into tournament graphs and weighted majoirty graphs.
Selected studies: N. Betzler, R. Bredereck and R. Niedermeier. Theoretical and empirical evaluation of data reduction for exact Kemeny Rank Aggregation. Autonomous Agents and Multi-Agent Systems, 28(5):721-748; 2014. | R. Bredereck. Fixed-Parameter Algorithms for Computing Kemeny scores - Theory and Practice. Thesis, Department of Mathematics and Computer Science, University of Jena, 2009. | N. Betzler, R. Bredereck, and R. Niedermeier. Partial Kernelization for Rank Aggregation: Theory and Experiments. Proc. 5th International Symposium on Parameterized and Exact Computation (IPEC), 2010.
Details
- Number of files: 79
- Total size: 417.27 KB
- Data types: soc
- Publication date: 2014-07-09