余弦相似度在计算文本相似度等问题中有着广泛的应用，scikit-learn中提供了方便的调用方法 第一种，使用cosine_similarity，传入一个变量a时，返回数组的第i行第j列表示a[i]与a[j]的余弦相似度 例： from sklearn.metrics.pairwise import. sklearn.metrics.pairwise.cosine_similarity (X, Y=None, dense_output=True) [source] Cosine similarity, or the cosine kernel, computes similarity as the normalized dot product of X and Y: On L2-normalized data, this function is equivalent to linear_kernel. Cosine similarity is a measure of similarity, often used to measure document similarity in text analysis. We use the below formula to compute the cosine similarity. 