How do you calculate the similarity score? Is it PageRank or something like it? And are you using a graph db like neo4j? (Which has powerful tools for calculation of various similarity scores)