Important
Must-know concepts that cover foundational Gen AI engineering topics.
3
Easy Concepts
18
Medium Concepts
5
Hard Concepts
Easy Concepts (3)
What are all the basics of Python?
pythonbasicsfundamentalsdata-structuresoop
How to use AI for day to day coding
ai-toolscodingproductivityprompt-engineeringcli
What are the current top-performing AI models as of May 2026? Compare their benchmark scores across key metrics.
gen-aibenchmarksmodelscomparison2026
View Details →Resources
Medium Concepts (18)
Explain me detailly about the Oops concept in Python?
pythonoopclassesinheritanceencapsulation
Explain me detailly about the types of principles in Python?
pythonprinciplessoliddrykiss
Explain me detailly about the Folder Structure need to be followed in Python along with the flow diagram?
pythonfolder-structureproject-structurearchitecturebest-practices
How to use Dependency Injection in Python?
pythondependency-injectiondifastapisolid
LLM vs SLM: What is the difference between Large Language Models and Small Language Models? When to use which one in Python?
gen-aillmslmmodel-selectioncost-optimization
In Flutter, heavy computational tasks run in Isolates. What is the equivalent in Python? How do you run CPU-heavy tasks without blocking the main thread?
pythonmultiprocessingconcurrencyparallelismprocess-pool
How to increase the performance of a Python application? Explain key optimization techniques with examples.
pythonperformanceoptimizationprofilingcaching
Are token counts directly proportional to context length? How do input and output tokens consume the context window?
gen-aillmtokenscontext-window
View Details →Code
What is a context window and how to manage the context window in LLM?
gen-aillmcontext-windowtokensrag
View Details →Code
What is Mixture of Experts in AI Models? Example - Qwen 3.5-122B-A10B
gen-aimixture-of-expertsmoearchitectureqwen
What are all the different model formats in LLM?
gen-aillmmodel-formatsggufgptq
What is Turbo Quant in ML/LLM?
gen-aiquantizationturbo-quanthqqnf4
Does the speed of running local AI (LLM) models on a GPU depend more on the type of GPU used, or on how much VRAM the GPU has?
gen-aigpuvramlocal-llmhardware
What are the different AI frameworks available in the market that are similar to MLX?
gen-aimlxai-frameworksjaxpytorch
What are the various number formats used in AI models, and what do abbreviations like FP32, BF16, MXFP8, NVFP4, INT4, and Q4 mean?
gen-aiquantizationnumber-formatsfp32bf16
View Details →Resources
How do LLMs set their maximum context window? Explain the role of architecture, training, and API configuration.
gen-aicontext-windowllmarchitecturetoken-limits
Hard Concepts (5)
Explain me detailly about the Architectural design patterns in Python along with the flow diagram?
pythondesign-patternsarchitecturesingletonfactory
In Flutter we have state-management to manage the state of the application similarly do we have anything in Python if yes explain me along with the flow diagram?
pythonstate-managementcontextvarsredislangchain
How does NVFP4 differ from standard INT4 quantization in terms of hardware-level support and weight distribution handling for LLMs? Are there specific use cases where one significantly outperforms the other in accuracy?
gen-aiquantizationnvfp4int4nvidia
What is the internal difference between using INT4 vs NVFP4? Which is better, what internal optimizations have been done, and is this related to model training base or usage/agent architecture?
gen-aiquantizationnvfp4int4nvidia
How do INT4 and NVFP4 each split their 4 bits, and what does that mean for precision distribution, activation handling, and the W4A16 vs W4A4 trade-off?
gen-aiquantizationnvfp4int4w4a16