Avatar
🧠

Organizations

    • Done

      • DONE Review test cases again (50%)
    journal Created Tue, 26 Aug 2025 00:00:00 +0000
  • Note

    • https://www.reddit.com/r/vibecoding/comments/1myakhd/how_we_vibe_code_at_a_faang/
      • Always start with a solid design document and architecture.
      • Build from there, always write tests first.
      • It’s not too different from how Holistics currently works.
      • Implies that the best way to “vibe code” is not to vibe code at all.
      • I slowly get out of the thinking that AI can plan and code 100% without human intervention.
        • Looking back at my thesis, which was 100% AI-generated, is totally trash.
    • Done

      • DONE Write document for total_items in mart_product__tenant_features_usage_monthly
    journal Created Mon, 25 Aug 2025 00:00:00 +0000
  • Done

    • DONE Add dbt model for event tracking tagging system

    • DONE Collect AI issue

    journal Created Fri, 22 Aug 2025 00:00:00 +0000
  • journal Created Thu, 21 Aug 2025 00:00:00 +0000
    • Done

      • DONE Xin bản sao trích lục giấy khai sinh
    journal Created Mon, 18 Aug 2025 00:00:00 +0000
    • Note

      • Options for providing data to AI:
        • Metadata: titles, labels, descriptions, text blocks, data types, model relationships, formulas, chart settings, AMQL codes, git commit messages.
        • Sample data: Few sampled values of the source database columns.
        • Result data: result data (in form of csv or images) of the generated chart.
    • Done

      • DONE Fix wrong tests
    journal Created Fri, 15 Aug 2025 00:00:00 +0000
    • Done

      • DONE Fix wrong AQL in tests
    journal Created Thu, 14 Aug 2025 00:00:00 +0000
    • Note

      • https://openai.com/index/introducing-swe-bench-verified/
      • Revise the goals of benchmarking:
        • P1 Internal: Track our AI’s improvement.
        • P1 External: Communicate (firstly with internal Holistics stakeholders: Sales team, Product team; then with prospects & customers).
      • This is divided into many tasks:
        • Creating test cases - I’m working on this.
        • Evaluation method.
        • Evaluation pipeline.
        • Presenting evaluation result.
      • Goal of creating test suite for benchmarking:
        • Make a gold standard for evaluating agents’ ability to work with data-related business problems, rather than generating SQL/AQL snippets (like text-to-sql models do).
        • Have a measurable way to indicate how good our AI is (in aspect of better support analytics techniques - running total, percent of total, trend analysis; reduce lines of code) that correlate with real data analyst productivity.
        • Ease of comparing Holistics’s AI with chatgpt, text-to-SQL models, or competitors (?).
        • Identify capability gaps (where all agents fail).
      • Expected one-line pitch:
        • Our latest AI agent can now successfully handle 95% of common analytical patterns like Period-over-Period comparisons and 85% of complex multi-step Cohort analyses, representing a 20% quarterly improvement in advanced analytical support, according to our benchmark dataset.

    journal Created Tue, 12 Aug 2025 00:00:00 +0000
  • Done

    journal Created Mon, 11 Aug 2025 00:00:00 +0000
  • journal Created Sun, 10 Aug 2025 00:00:00 +0000