semantic similarity

텍스트 내 단어 끼리의 similarity를 정량화한 것

비슷한 단어끼리 그룹화 가능
natural language understanding에서 블록을 만들 수 있음
- textual entailment(텍스트 전체의 내용을 반영하여 텍스트 내부의 sentence 의미 추론)
- paraphrasing

**1. Word Net이란? **

영어의 의미 어휘 목록. 영어 단어의 의미관계를 저장해 놓은 데이터베이스라고 생각하면 됨.

3. Lin Similarity

find similariy between 2 means

path similarity - find the path between the two concepts

similariy measure inversly related to path distance

elk&deer - distance 1 - pathsim(1/1+1)

elk&giraffe - distance 2 -pathsim 1/(1+2)

Lowest common subsumer(LCS)

deer&giraffe - ruminant

deer&elk = deer

Lin Similarity

based on the information contained in the LCS of the 2 concepts

python

deer.n.01: first syntax(noun) meaning of deer

path_similarity

not distance

collocations and distributional similarity

distributional similarity

strength of association between words

topic modeling

topic modeling 6:00

what’s known

(1,(2,(3

PLSA // LDA

Nlp Topic_modeling