Skip to content
@CLaiM-team

CLaiM

CLaiM

RAG ๊ธฐ๋ฐ˜ ๋ณดํ—˜์ฒญ๊ตฌ์‹ฌ์‚ฌ ์ž๋™ํ™” ์„œ๋น„์Šค

image

ํ”„๋กœ์ ํŠธ ๊ฐœ์š”

"CLaiM"์€ RAG(Retrieval-Augmented Generation) ๊ธฐ๋ฐ˜ ์ฒญ๊ตฌ์‹ฌ์‚ฌ ์ž๋™ํ™” ์„œ๋น„์Šค๋กœ, ์‚ฌ์šฉ์ž์˜ ๋ณดํ—˜ ์ฒญ๊ตฌ ์ •๋ณด์— ๋Œ€ํ•ด ์ €์žฅ๋œ ์œ ์‚ฌํ•œ ์•ฝ๊ด€์„ ๊ฒ€์ƒ‰ํ•˜๊ณ  ๋ถ„์„ํ•˜์—ฌ ์‹ฌ์‚ฌ ์˜๊ฒฌ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

์ฃผ์š” ํŠน์ง•

๐Ÿ” ์‚ฌ์šฉ์ž๋ณ„ ๋ณดํ—˜ ๋‚ด์—ญ ๊ด€๋ฆฌ ์‹œ์Šคํ…œ

  • ๋…๋ฆฝ์ ์ธ ๋ณดํ—˜ ๋‚ด์—ญ ๊ด€๋ฆฌ: ๊ฐ ์‚ฌ์šฉ์ž๋ณ„๋กœ ๋ณ„๋„์˜ ๋ณดํ—˜ ์•ฝ๊ด€ ๋“ฑ๋ก ๋ฐ ๊ด€๋ฆฌ
  • ์‚ฌ์šฉ์ž๋ณ„ ๋ฒกํ„ฐ์Šคํ† ์–ด: faiss_db/user_{user_id}_{insurance_type} ๊ตฌ์กฐ๋กœ ๊ฒฉ๋ฆฌ

๐Ÿค– AI ๊ธฐ๋ฐ˜ ์‹ฌ์‚ฌ ๋ฐ ์š”์•ฝ ์ž๋™ํ™”

  • ํ•œ์ค„ ์š”์•ฝ: "์‹ฌ์‚ฌ ํ†ต๊ณผ ํ™•๋ฅ  ๋†’์Œ/๋ณดํ†ต/๋‚ฎ์Œ" ํ˜•ํƒœ๋กœ ์ฆ‰์‹œ ํ™•์ธ ๊ฐ€๋Šฅ
  • LLM ๋ถ„์„: OpenAI/Qwen ๋ชจ๋ธ์„ ํ†ตํ•œ ์ง€๋Šฅ์ ์ธ ์Šน์ธ ๊ฐ€๋Šฅ์„ฑ ํŒ๋‹จ

UX/UI

image

Architecture Diagram

image

API Sequence Diagram

image

ERD

image

์ฃผ์š” ๊ธฐ๋Šฅ

1. ์‚ฌ์šฉ์ž ๊ด€๋ฆฌ ์‹œ์Šคํ…œ

  • ์‚ฌ์šฉ์ž๋ณ„ ํ”„๋กœํ•„: ๋‚˜์ด, ์„ฑ๋ณ„, ๊ธฐ์กด ๋ณ‘๋ ฅ ์ •๋ณด ํฌํ•จ
  • ๋…๋ฆฝ์ ์ธ ์„ธ์…˜: ๊ฐ ์‚ฌ์šฉ์ž๋ณ„๋กœ ๋ณ„๋„์˜ ๋ณดํ—˜ ๋ฐ์ดํ„ฐ ๊ด€๋ฆฌ
  • SQLite ํ†ตํ•ฉ: ์‚ฌ์šฉ์ž ์ •๋ณด์™€ ๋ณดํ—˜ ๋ฐ์ดํ„ฐ ์˜๊ตฌ ์ €์žฅ

2. ๋ณดํ—˜ ์•ฝ๊ด€ ๊ด€๋ฆฌ

  • ์ข…๋ฅ˜๋ณ„ ๋ถ„๋ฅ˜: ์ƒ๋ช…๋ณดํ—˜, ์†ํ•ด๋ณดํ—˜, ์ž๋™์ฐจ๋ณดํ—˜ 3๊ฐœ ์นดํ…Œ๊ณ ๋ฆฌ
  • ์‚ฌ์šฉ์ž๋ณ„ ์—…๋กœ๋“œ: ๊ฐ ์‚ฌ์šฉ์ž๊ฐ€ ์ž์‹ ๋งŒ์˜ ์•ฝ๊ด€ ๋“ฑ๋ก ๊ฐ€๋Šฅ
  • PDF ์ฒ˜๋ฆฌ: ํ…์ŠคํŠธ ์ถ”์ถœ, ์ •๊ทœํ™”, ์ฒญํฌ ๋ถ„ํ• 
  • ๋ฒกํ„ฐ ์ €์žฅ: ์‚ฌ์šฉ์ž๋ณ„ FAISS ๋ฒกํ„ฐ์Šคํ† ์–ด ์ƒ์„ฑ ๋ฐ ๊ด€๋ฆฌ

3. ์ง€๋Šฅํ˜• ์‹ฌ์‚ฌ ์‹œ์Šคํ…œ

  • RAG ๊ฒ€์ƒ‰: ์ƒ์œ„ 3๊ฐœ ์œ ์‚ฌ ์•ฝ๊ด€ ๊ฒ€์ƒ‰ (k=3)
  • MMR ๋‹ค์–‘์„ฑ: Maximum Marginal Relevance๋กœ ๊ฒฐ๊ณผ ์ตœ์ ํ™”
  • AI ๋ถ„์„: GPT-4o / Qwen 1.5 ๊ธฐ๋ฐ˜ ์‹ฌ์‚ฌ ์˜๊ฒฌ ์ œ๊ณต
  • ์Šน์ธ ํ™•๋ฅ : ๋†’์Œ/๋ณดํ†ต/๋‚ฎ์Œ์œผ๋กœ ๊ฐ„๋‹จ ์š”์•ฝ



๊ธฐ์ˆ  ์Šคํƒ

  • Backend: FastAPI, Python 3.13, SQLAlchemy, SQLite
  • Frontend: Streamlit
  • AI/ML:
    • Embedding: OpenAI Embeddings, BAAI/BGE-M3 (์„ ํƒ ๊ฐ€๋Šฅ)
    • LLM: OpenAI GPT-4o, Qwen 1.5(0.5B) (์„ ํƒ ๊ฐ€๋Šฅ)
    • Vector DB: FAISS (์‚ฌ์šฉ์ž๋ณ„ ๋…๋ฆฝ ์ธ๋ฑ์Šค)
    • Framework: LangChain
  • Database: SQLite (์‚ฌ์šฉ์ž/๋ณดํ—˜ ๋ฐ์ดํ„ฐ)
  • PDF Processing: PyPDF (ํ…์ŠคํŠธ ์ถ”์ถœ ๋ฐ ์ •๊ทœํ™”)
  • Dependencies: uvicorn, requests, python-dotenv

๐Ÿ‘ฅ Contributors

Pinned Loading

  1. .github .github Public

    CLaiM : RAG ๊ธฐ๋ฐ˜ ๋ณดํ—˜ ์ฒญ๊ตฌ์‹ฌ์‚ฌ ์ž๋™ํ™” ์„œ๋น„์Šค

Repositories

Showing 2 of 2 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loadingโ€ฆ

Most used topics

Loadingโ€ฆ