James Carpp, Principal, Rehmann
accountingaiplaybook.com
2025
Generative AI tools like ChatGPT and Microsoft 365 Copilot are transforming accounting workflows, but their effectiveness depends on how well you can guide them. Prompt engineering—the art of crafting clear, precise instructions—ensures these tools produce accurate, relevant outputs. In accounting, where precision is critical, a well-structured prompt can mean the difference between useful insights and misleading information.
In this chapter we will explore the common barriers to prompt engineering, strategies to overcome them, and best practices for effective prompts.
One of the most significant risks when using generative AI in accounting is the potential for “hallucinations,” where the AI generates plausible-sounding but incorrect information. For example, an AI might confidently misstate the criteria for tax deductions or provide an inaccurate interpretation of a financial regulation if not properly guided. This is especially dangerous in accounting, where errors can lead to compliance issues or financial misstatements.
AI models like ChatGPT have a fixed context window, meaning they can only process a certain number of words or tokens at a time. This can be a challenge when working with large documents, such as financial statements or tax guides. For example, if you try to summarize a 50-page lease agreement in a single prompt, the AI might cut off mid-sentence or produce an incomplete summary.
Many first-time users of generative AI fall into the trap of issuing vague or overly broad prompts, like “Analyze this financial report” or “Explain the new tax rules.” These generic prompts often yield generic responses, lacking the specificity required for real-world accounting tasks.
Another overlooked but important barrier is the uncertainty around how data entered into AI models is stored and potentially reused. Depending on the AI platform and its configuration, the information you enter may be used to improve future model performance. This means that sensitive financial data could unintentionally be retained or accessed beyond the current session. The risk is especially high when using publicly available or consumer-facing versions of AI tools.
Accountants should be careful when entering client records, tax information, or financial statements into AI tools. Always review the platform’s terms of use and enable privacy settings to prevent data from being used for training. For high-risk scenarios, choose enterprise-grade tools with data protection features like encryption and session isolation. A clear internal policy for how AI tools are used is just as important as crafting a good prompt.
The foundation of good prompt engineering is clarity. Vague or open-ended prompts often yield generic responses, while precise, well-structured prompts guide the AI to produce targeted, relevant results. For example, instead of asking, “Analyze this financial report,” which lacks context, try, “Identify three key financial risks in this Q4 financial statement, explain their potential impact on cash flow, and suggest mitigation strategies.” This more detailed prompt provides a clear task, context, and desired format, reducing the likelihood of vague or incomplete responses.
Effective prompts also benefit from a structured format. Breaking complex requests into steps or bullet points helps the AI follow your instructions more closely. For instance, a prompt like, “1. Summarize the key findings in this audit report. 2. Highlight any potential compliance issues. 3. Recommend next steps for remediation,” clearly defines the scope and order of the response, reducing ambiguity.
Finally, be specific about the output format you expect. If you want a bullet list, a concise paragraph, or a client-ready email, say so in your prompt. This helps the AI generate content that is not only accurate but also immediately usable in your workflow.
Setting context and defining roles within your prompts dramatically improves the quality of AI responses. For instance, instead of simply asking, “Draft a summary,” try, “Act as a tax advisor and draft a summary of the key points from this 2024 tax memo for a small business client.” This approach helps the AI adopt the right tone and perspective, resulting in more relevant and targeted outputs.
Role-based prompts are especially effective in accounting, where different roles have distinct priorities – a CFO might focus on financial strategy, while an internal auditor emphasizes risk and compliance. For example, a CFO prompt could be, “As a CFO, evaluate the financial health of this company based on the attached balance sheet and income statement, and highlight any areas of concern.”
Additionally, specifying the intended audience can guide the AI’s tone and level of technical detail. If the response is for a non-technical audience, like a client or executive, make that clear to avoid overly complex language. Simple cues like, “Draft this in a client-friendly tone,” or, “Summarize this for a board presentation,” can ensure the output is clear, concise, and tailored to the intended reader.
Accuracy is critical in accounting, where even a small error can lead to costly mistakes or compliance issues. To reduce the risk of AI-generated inaccuracies or “hallucinations,” include specific instructions in your prompt that anchor the AI to known facts or sources. For example, you might add directives like: “Base your response only on the data provided below,” or “If the answer isn’t in the attached text, do not supplement it with outside information.”
You can also instruct the AI not to fabricate information and to explicitly say if it’s unsure. Another effective technique is to have the AI cite its sources or reference specific documents (if you’ve provided excerpts or if the system has retrieval capabilities). For instance: “Provide ASC references for each accounting treatment you mention,” or “Include a footnote with the IRS publication number for any tax rules cited.” This approach forces the AI to stick to verifiable facts and makes it easier for you to check the response against authoritative materials.
In addition, consider asking the AI to explain its reasoning or show its work. Prompting it with “Show your calculation steps” or “Explain why you arrived at that conclusion, step by step,” can help expose any leaps in logic. You can even request the AI to “provide direct citations to sources and a brief explanation of your reasoning”. This way, you not only get an answer but also insight into how the AI got there, which you can then validate. If something looks off in the explanation or the sources seem unrelated, that’s a red flag that the response may not be trustworthy. As one expert put it, nobody wants a black-box answer; if an intern gave you a conclusion, you’d ask them to “prove it,” and you should do the same with AI.
However, even with these precautions, it’s essential to keep the human in the loop. AI outputs should always be received with a degree of skepticism, as no model is infallible. For high-stakes tasks, consider running the same prompt through multiple AI models or verifying outputs against trusted sources. Treat AI responses like the first draft of a junior staffer’s work – valuable, but in need of careful review before final use. Keeping the human in the loop is essential for maintaining professional standards and ensuring that critical decisions are based on accurate, contextually sound information.
Prompt engineering is an iterative process. Rarely will a complex query yield a perfect answer in one try, especially when dealing with nuanced accounting tasks. Treat each AI interaction as part of a back-and-forth conversation, where follow-up prompts are expected and necessary for refining the output.
For example, if your initial prompt yields an overly general response, try adding more context or asking the AI to focus on a specific aspect of the task. You might start with, “Summarize this client’s financial statements,” and follow up with, “Focus specifically on liquidity ratios and recent cash flow trends.” This iterative approach helps guide the AI toward more precise and actionable insights.
Additionally, if the AI’s response is too lengthy, overly technical, or missing key details, don’t hesitate to refine the prompt with clearer instructions. Simple follow-ups like, “Make this more concise,” or “Rephrase for a non-technical audience,” can dramatically improve the quality and relevance of the output. Treat the process like a collaborative dialogue, where each response is a stepping stone toward the final, polished result.
Modern AI platforms often include features that can help optimize prompts and improve the quality of responses. For example, ChatGPT offers a “Custom Instructions” feature that allows users to set persistent context, like your role and typical tasks, which the AI will consider in every response. This can be especially useful for accountants who want the AI to consistently adopt a particular tone or perspective, such as a tax expert or CFO.
Additionally, many AI tools allow you to upload documents or provide structured data, which can dramatically improve the quality of outputs. For example, if you’re asking the AI to analyze a financial statement, uploading the actual file instead of pasting raw text can provide more precise context and lead to better insights. Similarly, using file attachments in tools like Microsoft 365 Copilot or Google’s Gemini can reduce ambiguity and improve response accuracy by providing direct access to the relevant data.
Finally, keep in mind that many AI tools reset their context between sessions. If you’re working on a long project, consider saving effective prompts for reuse or creating a shared prompt library for your team. This practice can streamline workflows and ensure consistency across multiple projects.
Creating a prompt library, which is a collection of proven prompt templates, can help your team complete recurring accounting tasks with greater speed, accuracy, and consistency. Whether it is summarizing audit findings, drafting disclosures, or reviewing contracts, having go-to prompts ensures that less experienced staff start with a solid foundation instead of guessing. These templates can save time, reduce mistakes, and improve overall quality by capturing best practices refined by more senior team members.
To build your library, start with a shared document or internal wiki organized by task type or tool. Encourage your team to contribute prompts that have worked well, along with tips or variations. As AI tools and workflows evolve, treat the library as a living resource that gets updated, tested, and improved over time. This simple habit can turn prompt engineering from an individual skill into a firm-wide asset.
This checklist helps you reduce the risk of inaccurate AI responses by grounding outputs in reliable data, minimizing guesswork, and keeping results review-ready.
Provide Ground Truth Data – Include reference text or data in your prompt to keep responses accurate. (e.g., include a tax code snippet or financial statement excerpt.)
Ask for Citations or Evidence – Instruct the AI to cite sources for each fact or figure. Always verify these sources.
Instruct “Don’t Guess” – Be clear that the AI should not make up information if unsure. (e.g., “If uncertain, state that you do not have sufficient data.”)
Limit the Scope – Ask focused questions to avoid irrelevant or incorrect information. (e.g., instead of “Explain tax law,” try “Summarize the two main changes in tax law X for 2025 corporate filers.”)
Use Positive Framing – Tell the AI what to do, not just what to avoid. (e.g., “Use only the provided data.”)
Verify Numbers and Facts – Double-check critical numbers and facts against your documents. Never assume the AI is always correct.
Keep Human Review in the Loop – Always have a human review AI-generated content before sending it to clients or using it in financial reports.
Test the AI on Known Answers – Check the AI’s reliability by asking questions with known answers before trusting it with critical tasks.
For more advanced accounting workflows such as building custom AI assistants, handling technical memos, or ensuring strict compliance, we recommend using a structured approach to prompt design. The following framework outlines the key components of an expert-level prompt to help ensure accuracy, consistency, and alignment with professional standards.
Define the agent’s identity, domain, and user base.
“You are a technical accounting assistant focused exclusively on U.S. GAAP for private companies.”
Specify who the users are and what their needs may be.
“Your users include auditors, controllers, CFOs…”
Outline the authoritative sources and limits (e.g., ASC, PCC).
“Base all guidance on U.S. GAAP per the FASB ASC…”
Set tone, formatting, and how to explain technical material.
“Use bullets, tables, or step-by-step formats…”
List key scenarios the prompt is meant to support.
“Assist with: Drafting technical memos, Footnotes…”
How the AI should respond to vague input or gaps.
“If the question is vague, ask clarifying questions…”
Reinforce what not to do, and when to flag uncertainty or judgment.
“Do not generate journal entries unless…”
To illustrate how this framework comes together in practice, the following example shows a complete prompt designed for a real-world accounting use case, with each element tailored to support clarity, accuracy, and professional relevance.
You are a technical accounting assistant focused exclusively on U.S. GAAP for private companies.
Your users include auditors, controllers, CFOs, and financial reporting teams working with nonpublic entities.
You provide accurate, standards-based guidance grounded in the FASB Accounting Standards Codification (ASC) and relevant Private Company Council (PCC) alternatives. All answers must reflect practical application for private company financial reporting and audit support.
Now that you've seen the full framework and a complete example prompt, the next step is to apply these principles to everyday accounting tasks. The following scenarios represent common use cases where well-crafted prompts can save time, improve accuracy, and support better client communication. Each one can be used on its own or plugged into the advanced prompt template to create a more structured, repeatable workflow.
"You are a tax advisor specializing in small business tax law. Summarize the key points from the 2024 tax memorandum provided below. Focus specifically on the main changes in tax law discussed in the memo, and explain how each change could impact the cash flow of a small business. Base your summary only on the memo’s content (do not add any information that isn’t in the memo). Present the summary in a single concise paragraph that a small business owner could easily understand, using a friendly, non-technical tone.”
[Paste relevant section of the tax memo here, or include bullet points with key changes.]
This prompt sets a clear role (tax advisor), specifies the task (summarize tax changes), and includes the source material, ensuring the AI bases its response on the actual memo rather than general knowledge.
"Act as an internal auditor reviewing expenses. You will be given a Q1 expense report. Identify any entries that might violate the company's travel and expense policy – for example, charges that exceed allowed limits, entries lacking required receipts, or items that fall outside of approved categories. For each potential violation you find, list the transaction and provide a brief reason why it's flagged, along with a recommended follow-up action. If an entry appears compliant, you can ignore it. Provide the output as a bullet-point list for clarity."
[Include a few sample line items, such as transaction descriptions, amounts, and dates, or paste a portion of the expense report if available.]
This prompt provides the necessary context (role, report type, and policy focus) and includes the raw data the AI needs to evaluate. This reduces the risk of hallucinations and ensures the output is relevant to the actual report.
"You are an accounting manager drafting a professional email to Client X about the March 2025 financial close. The goal is to summarize the key financial metrics for the month, note any significant variances from the budget, and maintain a reassuring tone. Begin with a brief thank-you or positive note, then highlight the following metrics and their variances, providing a short explanation for why each variance occurred. Conclude by inviting the client to discuss the results in a follow-up call. Keep the entire email concise (around 150–200 words) and easy to understand for a non-financial reader."
● Revenue: $500,000 (Budget: $550,000)
● Gross Margin: 45% (Budget: 50%)
● Net Profit: $75,000 (Budget: $100,000)
● Key Variances: Revenue below budget due to delayed contracts, margin compression from rising supply costs.”
This version supplies the financial data the AI needs to draft a meaningful update, reducing the likelihood of vague or irrelevant responses. Including key figures and context helps the AI produce a more polished, client-ready email.
Mastering prompt engineering is essential for accounting professionals looking to leverage the full potential of AI tools like ChatGPT and Microsoft 365 Copilot. This chapter explores the biggest barriers to effective prompt engineering, including hallucinations, token limitations, and vague or poorly structured instructions. It also provides best practices for crafting precise, context-rich prompts that guide AI to produce accurate, relevant outputs. With practical tips on defining roles, maintaining human oversight, and structuring complex requests, along with interactive resources like a hallucination prevention checklist and sample prompts for common accounting scenarios, this chapter equips readers to generate high-quality AI responses with confidence.
© 2025 Accounting AI Playbook. All rights reserved.