Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: THEend8_
COMP SCI 7094 Distributed Databases and Data Mining Assignment
In this assignment you are required to implement the Bond Energy Algorithm of vertical frag-
mentation. Your code should contains two separate procedures AA Generator and CA Genera-
tor, where AA Generator takes the input of all attributes of a relation, a set of queries and their
access frequencies at different sites, and produces the output of an affinity matrix AA, and CA
Generator takes input of an affinity matrix AA and produces a clustered affinity matrix CA. For
description of the BEA algorithm, definitions of AA and CA, please see lecture slides/textbook.
In this assignment, the Attribute Affinity is measured by the extended Otsuka-Ochiai coef-
ficient instead of the traditional method de-
scribed in the textbook. The following equations show the details of the computation, where q
is the number of attributes, and m is the number of sites, Aik is the number of times Attribute
Ai is accessed by Query qk, considering of all sites. For the result of division, you must round it
up to the nearest integer. (Use DOUBLE ,instead of FLOAT ,during calculation ,may help
you get correct result)