Chinh (lelouvincx) / 2025-07-01

Created Tue, 01 Jul 2025 00:00:00 +0000 Modified Mon, 25 May 2026 06:02:25 +0000
82 Words
  • Note

    • Goal creating the benchmark:
      • Build trust - show the transparency of AI accuracy.
      • Have a sense of capacity (?).
      • Use this to informally evaluate Holistics AI vs. other tools.
    • https://www.sigmacomputing.com/blog/text-to-sql-data-chat
    • Current common benchmarks like Spider or BIRD is benchmarking for text-to-SQL problem, not the business-question-to-insight problem.
    • Idea: sync common AMQL questions from Zendesk.
  • Done

    • DONE Finalize the test suite approach
  • Read Holistics’s AI Philosophy.

  • Research common/standard GenBI benchmarking methods.

  • If not, go with text-to-SQL.

  • Write document and review with a Dat.