Question 1

What's the difference between input and output tokens?

Accepted Answer

Input tokens are everything you send to the model: your prompt, any system instructions, conversation history, and context. Output tokens are the tokens the model generates in its response. Most APIs charge differently for input vs output — output tokens typically cost more per token. For example, GPT-4o charges $2.50 per 1M input tokens but $10.00 per 1M output tokens, so a long response can be expensive.

Question 2

How many tokens is 1000 words?

Accepted Answer

Roughly 1,300–1,500 tokens for typical English prose. Code tends to tokenize to more tokens per word due to special characters and identifiers. Non-English languages (especially CJK scripts) can be 2–3x more tokens per character than English.

Question 3

What happens if I exceed the context window?

Accepted Answer

The API will return an error (typically a 400 with a message about context length). Some SDKs will silently truncate older messages instead. Always check your total token count — prompt + conversation history + expected response — against the model's context limit.

Question 4

Does this tool send my text to a server?

Accepted Answer

For GPT models, no — counting runs entirely in your browser. For Gemini exact counts, your text is sent to our server which calls Google's token counting API. Your text is not stored or logged.

Question 5

Which tokenizer does GPT-4o use?

Accepted Answer

GPT-4o and GPT-4o mini use the o200k_base encoding, which has a larger vocabulary than the older cl100k_base used by GPT-4 and GPT-3.5. This tool uses o200k_base for exact counts on OpenAI models.

AI Token Counter

What is an AI Token?

How to Use This Tool

Common Use Cases

Frequently Asked Questions