TGINSIGHT CHAT
Data Analytics
@sqlspecialist
EducationPerfect channel to learn Data Analytics Learn SQL, Python, Alteryx, Tableau, Power BI and many more For Promotions: @coderfun@love_data
Recent posts
Page 14 of 85 · 1,012 posts
Posted Feb 4
✅Data Analyst Interview Questions with Answers: Part-6 51. Difference between mean and median? Mean is the average. Median is the middle value. Example: Salaries - 20k, 22k, 25k, 30k, 1,00k Mean = 39.4k (skewed) Median = 25k (better representative) 52. What is standard deviation? It measures how spread out data is from the mean. Example: Avg sales = ₹10,000 Std dev = ₹500 → stable Std dev = ₹5,000 → volatile 53. What is variance? Square of standard deviation. Shows data spread mathematically. 54. What is correlation? Measures relationship between two variables. Range -1 to +1 Example: Ad spend vs sales = 0.8 → strong positive correlation. 55. Difference between correlation and causation? Correlation does not mean one causes the other. Example: Ice cream sales and drowning both increase in summer. 56. What is an outlier? A value far from others. Example: Order values - 500, 700, 800, 50,000 57. What is sampling? Using a subset of data to represent full dataset. Example: Survey 1,000 customers instead of 1 million. 58. What is distribution? Pattern showing how data values are spread. Example: Normal, skewed, uniform distributions. 59. What is skewness? Measures asymmetry of data. Example: Income data usually right-skewed. 60. When do you use median over mean? When data has outliers. Example: House prices, salaries. Double Tap ♥️ For Part-7
Posted Feb 3
✅Data Analyst Interview Questions with Answers: Part-5 41. What is data cleaning? Data cleaning is the process of fixing or removing incorrect, incomplete, or inconsistent data. Example: Removing duplicate customer records, Fixing wrong date formats. 42. How do you handle missing data? Common methods: - Remove rows (if few missing) - Replace with mean, median, or 0 - Use forward or backward fill Example (SQL): SELECT COALESCE(sales, 0) AS sales FROM orders; 43. How do you treat outliers? - Identify using sorting, box plots, or Z-score - Remove or cap extreme values Example: Sales = 10,000, 12,000, 15,000, 1,00,000 → outlier. 44. What is data normalization? Scaling data between 0 and 1. Example: Normalized value = (x - min) / (max - min) Used in ML and comparisons. 45. What is data standardization? Centers data around mean 0 with std dev 1. Example: Z = (x - mean) / std 46. How do you check data quality? - Accuracy - Completeness - Consistency - Validity - Timeliness Example: Sales should never be negative. 47. What is duplicate data? Same record appearing more than once. Example: Same customer ID repeated multiple times. 48. How do you validate source data? - Compare with source systems - Check row counts - Verify key metrics Example: Total revenue in report = total revenue in database. 49. What is data transformation? Converting data into usable format. Examples: - Converting dates - Creating new columns - Aggregating values 50. Why is data preparation important? Clean data = correct insights. Poor data leads to wrong decisions. Example: Wrong sales data → wrong business strategy. Double Tap ♥️ For Part-6
Posted Feb 2
Data Analyst Interview Questions with Answers: Part-4 31. What are Pivot Tables? Pivot tables summarize large datasets quickly. Example: Rows → Product, Values → Sum of Sales Result: Total sales per product in seconds. 32. Difference between VLOOKUP and XLOOKUP? VLOOKUP works left to right only. XLOOKUP works both ways and handles missing values better. Example: =XLOOKUP(A2, Products!A:A, Products!B:B) Fetches product name using product ID. 33. What is conditional formatting? Highlights data based on rules. Example: Highlight sales > 10000 in green. Helps spot top performers instantly. 34. What are COUNTIFS and SUMIFS? They apply conditions while counting or summing. Example: =SUMIFS(C:C, A:A, "East", B:B, "Laptop") Total sales of laptops in East region. 35. What is data validation? Restricts incorrect data entry. Example: Create dropdown for Region (East, West, North). Data → Data Validation → List. 36. How do you remove duplicates in Excel? Select data, Data → Remove Duplicates Example: Remove duplicate customer IDs. 37. What is IF formula used for? Applies logical conditions. Example: =IF(C2>5000,"High Sales","Low Sales") 38. Difference between relative and absolute reference? Relative → A2 changes when copied Absolute → $A$2 stays fixed Example: =A2*$E$1 Tax rate fixed while copying formula. 39. How do you clean data in Excel? Remove duplicates, TRIM extra spaces, Fix date formats, Handle blanks Example: =TRIM(A2) 40. What are common Excel mistakes analysts make? • Merged cells • Hard-coded values • No pivot tables • Poor formatting • No documentation Double Tap ♥️ For Part-5
Posted Feb 2
Posted Feb 2
Posted Feb 2
Posted Feb 2
Posted Feb 2
Posted Feb 1
Data Analyst Interview Questions with Answers: Part-3 21. What is SELECT used for? SELECT is used to fetch specific columns or data from a table. Example: SELECT customer_name, sales FROM orders; This query returns customer names and their sales from the orders table. 22. Difference between WHERE and HAVING? WHERE filters rows before aggregation. HAVING filters results after aggregation. Example: SELECT product, SUM(sales) AS total_sales FROM orders WHERE region = 'East' GROUP BY product HAVING SUM(sales) > 100000; Here, WHERE filters region first, HAVING filters aggregated sales. 23. What is GROUP BY? GROUP BY groups rows with the same values so aggregate functions can be applied. Example: SELECT region, SUM(sales) AS total_sales FROM orders GROUP BY region; This gives total sales per region. 24. What are aggregate functions? Aggregate functions perform calculations on multiple rows. Common examples: • COUNT → total rows • SUM → total value • AVG → average • MIN / MAX → smallest or largest value Example: SELECT COUNT(order_id), AVG(sales) FROM orders; 25. Difference between INNER JOIN and LEFT JOIN? INNER JOIN: Returns only matching records. LEFT JOIN: Returns all rows from left table and matching rows from right table. Example: SELECT o.order_id, c.customer_name FROM orders o LEFT JOIN customers c ON o.customer_id = c.customer_id; All orders appear even if customer info is missing. 26. What are subqueries? A subquery is a query inside another query. Example: SELECT * FROM orders WHERE sales > (SELECT AVG(sales) FROM orders); Returns orders with sales above average. 27. What is a CTE? CTE (Common Table Expression) is a temporary named result set that improves readability. Example: WITH sales_summary AS ( SELECT region, SUM(sales) AS total_sales FROM orders GROUP BY region ) SELECT * FROM sales_summary WHERE total_sales > 500000; 28. How do you handle duplicates in SQL? Identify duplicates: SELECT customer_id, COUNT(*) FROM orders GROUP BY customer_id HAVING COUNT(*) > 1; Remove duplicates (using ROW_NUMBER): DELETE FROM orders WHERE order_id IN ( SELECT order_id FROM ( SELECT order_id, ROW_NUMBER() OVER (PARTITION BY customer_id ORDER BY order_date) rn FROM orders ) t WHERE rn > 1 ); 29. How do you handle NULL values? Check NULL: SELECT * FROM orders WHERE sales IS NULL; Replace NULL: SELECT COALESCE(sales, 0) AS sales_amount FROM orders; 30. What are window functions? Window functions perform calculations across rows without grouping them. Example: SELECT customer_id, sales, ROW_NUMBER() OVER (PARTITION BY customer_id ORDER BY sales DESC) AS rn FROM orders; This ranks sales per customer without collapsing rows. Double Tap ♥️ For Part-4
Posted Jan 31
✅Data Analyst Interview Questions with Answers: Part-2 11. What is structured data? Structured data is organized in rows and columns with a fixed schema, making it easy to store and query using SQL. Example: Sales tables, customer databases. 12. What is semi-structured data? Semi-structured data does not follow a strict table format but contains tags or keys. Example: JSON files, XML data, API responses. 13. What is unstructured data? Unstructured data has no predefined format. Example: Emails, images, videos, customer reviews text. 14. What is a database? A database is an organized system used to store, manage, and retrieve data efficiently. Example: MySQL, PostgreSQL, SQL Server. 15. Difference between OLTP and OLAP? OLTP (Online Transaction Processing) → Handles daily transactions (e.g., orders, payments). OLAP (Online Analytical Processing) → Used for reporting and analysis. 16. What is a primary key? A primary key uniquely identifies each record in a table. Example: Customer_ID in a customer table. 17. What is a foreign key? A foreign key links one table to another using the primary key of another table. Example: Customer_ID in Orders table linking to Customers table. 18. What is a fact table? Fact table contains measurable business data like sales, revenue, or quantity. 19. What is a dimension table? Dimension table contains descriptive details like customer name, region, product category. 20. What is a data warehouse? A data warehouse is a centralized system that stores large volumes of historical data for analysis and reporting. Double Tap ♥️ For Part-3
Posted Jan 30
✅ Data Analyst Interview Questions with Answers 1. What is data analytics? Data analytics is the process of collecting, cleaning, analyzing, and interpreting data to support business decisions. The goal is to turn raw data into meaningful insights. 2. Difference between data analytics and data science? Data analytics focuses on analyzing historical data to answer what happened and why. Data science focuses on building predictive models to answer what will happen next using machine learning. 3. What problems does a data analyst solve? - Identifying trends and patterns - Explaining business performance - Finding reasons behind growth or decline - Supporting decision-making with data 4. What are the types of data analytics? - Descriptive – What happened - Diagnostic – Why it happened - Predictive – What may happen - Prescriptive – What action to take 5. What tools do data analysts use daily? - Excel for quick analysis - SQL for querying databases - Power BI or Tableau for dashboards - Python (sometimes) for automation - Statistics for interpretation 6. What is a KPI? A KPI (Key Performance Indicator) is a measurable value that shows how well a business or team is achieving its objectives. Example: Monthly revenue, churn rate. 7. Difference between a metric and a KPI? Metric: Any measurable value (page views, clicks). KPI: A critical metric directly linked to business goals (conversion rate, revenue growth). 8. What is descriptive analytics? Descriptive analytics summarizes historical data to understand past performance. Example: Total sales last month, average order value. 9. What is diagnostic analytics? Diagnostic analytics explains why something happened by comparing data and identifying root causes. Example: Sales dropped because website traffic decreased. 10. What does a typical day of a data analyst look like? - Pull data using SQL - Clean data in Excel or Power Query - Build or update dashboards - Analyze trends and metrics - Share insights with stakeholders Double Tap ♥️ For Part-2
Posted Jan 30
7 Misconceptions About Data Analytics (and What’s Actually True):📊🚀 ❌ You need to be a math or statistics genius ✅ Basic math + logical thinking is enough. Most real-world analytics is about understanding data, not complex formulas. ❌ You must learn every tool before applying for jobs ✅ Start with core tools (Excel, SQL, one BI tool). Master fundamentals — tools can be learned on the job. ❌ Data analytics is only about numbers ✅ It’s about storytelling with data — explaining insights clearly to non-technical stakeholders. ❌ You need coding skills like a software developer ✅ Not required. SQL + basic Python/R is enough for most analyst roles. Deep coding is optional, not mandatory. ❌ Analysts just make dashboards all day ✅ Dashboards are just one part. Real work includes data cleaning, business understanding, ad-hoc analysis, and decision support. ❌ You need huge datasets to be a “real” data analyst ✅ Even small datasets can provide powerful insights if the questions are right. ❌ Once you learn analytics, your learning is done ✅ Data analytics evolves constantly — new tools, business problems, and techniques mean continuous learning. 💬Tap ❤️ if you agree