Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • AI Machine-Vision Earns Man Overboard Certification
    • Battery recycling startup Renewable Metals charges up on $12 million Series A
    • The Influencers Normalizing Not Having Sex
    • Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)
    • Today’s NYT Wordle Hints, Answer and Help for April 20 #1766
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Monday, April 20
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Artificial Intelligence»Understanding Convolutional Neural Networks (CNNs) Through Excel
    Artificial Intelligence

    Understanding Convolutional Neural Networks (CNNs) Through Excel

    Editor Times FeaturedBy Editor Times FeaturedNovember 17, 2025No Comments13 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    as a black field. We all know that it learns from information, however the query is how it actually learns.

    On this article, we are going to construct a tiny Convolutional Neural Community (CNN) immediately in Excel to know, step-by-step, how a CNN really works for photographs.

    We’ll open this black field, and watch every step occur proper earlier than our eyes. We’ll perceive all of the calculations which might be the muse of what we name “deep studying”.

    This text is in a collection of articles about implementing machine studying and deep studying algorithms in Excel. And you could find all of the Excel information on this Kofi link.

    1. How Photos are Seen by Machines

    1.1 Two Methods to Detect One thing in an Picture

    Once we attempt to detect an object in an image, like a cat, there are two major methods: the deterministic method and the machine studying method. Let’s see how these two approaches work for this instance of recognizing a cat in an image.

    The deterministic approach means writing guidelines by hand.

    For instance, we are able to say {that a} cat has a spherical face, two triangle ears, a physique, a tail, and so on. So the developer will do all of the work to outline the foundations.

    Then the pc runs all these guidelines, and provides a rating of similarity.

    Deterministic method to detect a cat on an image — picture by creator

    The machine studying method implies that we don’t write guidelines by ourselves.

    As an alternative, we give the pc many examples, footage with cats and footage with out cats. Then it learns by itself what makes a cat a cat.

    Machine studying method to detect a cat on an image — picture by creator (cats are generated by AI)

    That’s the place issues could change into mysterious.

    We normally say that the machine will determine it out by itself, however the actual query is how.

    Actually, we nonetheless have to inform the machines easy methods to create these guidelines. And guidelines must be learnable. So the important thing level is: how can we outline the type of guidelines that can be used?

    To grasp easy methods to outline guidelines, we first have to know what a picture is.

    1.2 Understanding What an Picture Is

    A cat is advanced kind, however we are able to take a easy and clear instance: recognizing handwritten digits from the MNIST dataset.

    First, what’s a picture?

    A digital picture could be seen as a grid of pixels. Every pixel is a quantity that exhibits how brilliant it’s, from 0 for white to 255 for black.

    In Excel, we are able to symbolize this grid with a desk the place every cell corresponds to 1 pixel.

    MNIST Handwritten digits – picture from the MNIST dataset https://en.wikipedia.org/wiki/MNIST_database (CC BY-SA 3.0)

    The unique dimension of the digits is 28 x 28. However to maintain issues easy, we are going to use a ten×10 desk. It’s sufficiently small for fast calculations however nonetheless massive sufficient to point out the final form.

    So we are going to cut back the dimension.

    For instance, the handwritten quantity “1” could be represented by a ten×10 grid as under in Excel.

    Picture is a grid of numbers — picture by creator

    1.3 Earlier than Deep Studying: Basic Machine Studying for Photos

    Earlier than utilizing CNNs or any deep studying methodology, we are able to already acknowledge easy photographs with basic machine studying algorithms reminiscent of logistic regression or choice timber.

    On this method, every pixel turns into one function. For instance, a ten×10 picture has 100 pixels, so there are 100 options as enter.

    The algorithm then learns to affiliate patterns of pixel values with labels reminiscent of “0”, “1”, or “2”.

    Basic ML for picture recognition — picture by creator

    Actually with this easy machine studying method, logistic regression can obtain fairly good outcomes with an accuracy round 90%.

    This exhibits that basic fashions are capable of be taught helpful data from uncooked pixel values.

    Nonetheless, they’ve a significant limitation. They deal with every pixel as an impartial worth, with out contemplating its neighbors. Because of this, they can not perceive spatial relationships with the pixels.

    So intuitively, we all know that the efficiency is not going to be good for advanced photographs. So this methodology is just not scalable.

    Now, in case you already understand how basic machine studying works, you realize that there isn’t a magic. And in reality, you already know what to do: you must enhance the function engineering step, you must remodel the options, to be able to get extra significant data from the pixels.

    2. Constructing a CNN Step by Step in Excel

    2.1 From advanced CNNs to a easy one in Excel

    Once we discuss Convolutional Neural Networks, we frequently see very deep and sophisticated architectures, like VGG-16. Many layers, 1000’s of parameters, and numerous operations, it appears very advanced, and say that it’s not possible to know precisely the way it works.

    VGG16 structure — picture by creator

    The primary concept behind the layers is: detecting patterns step-by-step.

    With the instance of handwritten digits, let’s ask a query: what could possibly be the best attainable CNN structure?

    First, for the hidden layers, earlier than doing all of the layers, let’s cut back the quantity. What number of? Let’s do one. That’s proper: just one.

    As for the filters, what about their dimensions? In actual CNN layers, we normally use 3×3 filters to detect small sample. However let’s start with large ones.

    How large? 10×10!

    Sure, why not?

    This additionally implies that you don’t have to slip the filter throughout the picture. This fashion, we are able to immediately examine the enter picture with the filter and see how properly they match.

    This straightforward case is just not about efficiency, however about readability.
    It is going to present how CNNs detect patterns step-by-step.

    Now, now we have to outline the variety of filters. We’ll say 10, it’s the minimal. Why? As a result of there are 10 digits, so now we have to have a minimal of 10 filters. And we are going to see how they are often discovered within the subsequent part.

    Within the picture under, you will have the diagram of this easiest structure of a CNN neural community:

    The only CNN structure – picture by creator

    2.2 Coaching the Filters (or Designing Them Ourselves)

    In an actual CNN, the filters will not be written by hand. They’re realized throughout coaching.

    The neural community adjusts the values inside every filter to detect the patterns that finest assist to acknowledge the pictures.

    In our easy Excel instance, we is not going to prepare the filters.

    As an alternative, we are going to create them ourselves to know what they symbolize.

    Since we already know the shapes of handwritten digits, we are able to design filters that appear to be every digit.

    For instance, we are able to draw a filter that matches the type of 0, one other for 1, and so forth.

    Another choice is to take the typical picture of all examples for every digit and use that because the filter.

    Every filter will then symbolize the “common form” of a quantity.

    That is the place the frontier between human and machine turns into seen once more. We are able to both let the machine uncover the filters, or we are able to use our personal data to construct them manually.

    That’s proper: machines don’t outline the character of the operations. Machine studying researchers outline them. Machines are solely good to do loops, to seek out the optimum values for these defines guidelines. And in easy instances, people are all the time higher than machines.

    So, if there are solely 10 filters to outline, we all know that we are able to immediately outline the ten digits. So we all know, intuitively, the character of those filters. However there are different choices, after all.

    Now, to outline the numerical values of those filters, we are able to immediately use our data. And we can also use the coaching dataset.

    Beneath you may see the ten filters created by averaging all the pictures of every handwritten digit. Each exhibits the everyday sample that defines a quantity.

    Common values as filters — picture by creator

    2.3 How a CNN Detects Patterns

    Now that now we have the filters, now we have to check the enter picture to those filters.

    The central operation in a CNN known as cross-correlation. It’s the key mechanism that permits the pc to match patterns in a picture.

    It really works in two easy steps:

    1. Multiply values/dot product: we take every pixel within the enter picture, and we are going to multiply it by the pixel in the identical place of the filter. Because of this the filter “appears to be like” at every pixel of the picture and measures how related it’s to the sample saved within the filter. Sure, if the 2 values are massive, then the result’s massive.
    2. Add outcomes/sum: The merchandise of those multiplications are then added collectively to provide a single quantity. This quantity expresses how strongly the enter picture matches the filter.
    Instance of Cross Correlation for one image – picture by creator

    In our simplified structure, the filter has the identical measurement because the enter picture (10×10).

    Due to this, the filter doesn’t want to maneuver throughout the picture.
    As an alternative, the cross-correlation is utilized as soon as, evaluating the entire picture with the filter immediately.

    This quantity represents how properly the picture matches the sample contained in the filter.

    If the filter appears to be like like the typical form of a handwritten “5”, a excessive worth implies that the picture might be a “5”.

    By repeating this operation with all filters, one per digit, we are able to see which sample provides the best match.

    2.4 Constructing a Easy CNN in Excel

    We are able to now create a small CNN from finish to finish to see how the complete course of works in observe.

    1. Enter: A ten×10 matrix represents the picture to categorise.
    2. Filters: We outline ten filters of measurement 10×10, every one representing the typical picture of a handwritten digit from 0 to 9. These filters act as sample detectors for every quantity.
    3. Cross correlation: Every filter is utilized to the enter picture, producing a single rating that measures how properly the picture matches that filter’s sample.
    4. Resolution: The filter with the best rating provides the expected digit. In deep studying frameworks, this step is usually dealt with by a Softmax operate, which converts all scores into chances.
      In our easy Excel model, taking the most rating is sufficient to decide which digit the picture almost certainly represents.
    Each 10×10 filter represents the average shape of a handwritten digit (0–9).
The input image is compared with all filters using cross-correlation.
The filter that produces the highest score — after normalization with Softmax — corresponds to the detected digit.
    Cross-correlation of the enter digit with ten common digit filters. The best rating, normalized by Softmax, identifies the enter as “6.” – picture by creator

    2.5 Convolution or Cross Correlation?

    At this level, you may surprise why we name it a Convolutional Neural Community when the operation we described is definitely cross-correlation.

    The distinction is refined however easy:

    • Convolution means flipping the filter each horizontally and vertically earlier than sliding it over the picture.
    • Cross-correlation means making use of the filter immediately, with out flipping.

    For extra data, you may learn this text:

    For some historic purpose, the time period Convolution stayed, whereas the operation that’s really completed in a CNN is cross-correlation.

    As you may see, in most deep-learning frameworks, reminiscent of PyTorch or TensorFlow, really use cross-correlation when performing “convolutions”.

    Cross correlation and convolution — picture by creator

    Briefly:

    CNNs are “convolutional” in title, however “cross-correlational” in observe.

    3. Constructing Extra Advanced Architectures

    3.1 Small filters to detect extra detailed patterns

    Within the earlier instance, we used a single 10×10 filter to check the entire picture with one sample.

    This was sufficient to know the precept of cross-correlation and the way a CNN detects similarity between a picture and a filter.

    Now we are able to take one step additional.

    As an alternative of 1 international filter, we are going to use a number of smaller filters, every of measurement 5×5. These filters will have a look at smaller areas of the picture, detecting native particulars as a substitute of your complete form.

    Let’s take an instance with 4 5×5 filters utilized to a handwritten digit.

    The enter picture could be lower into 4 smaller components of 5×5 pixels for every one.

    We nonetheless can use the typical worth of all of the digits to start with. So every filter will give 4 values, as a substitute of 1.

    Smaller filters in CNN for digits recognition – picture by creator

    On the finish, we are able to apply a Softmax operate to get the ultimate prediction.

    However on this easy case, it is usually attainable simply to sum all of the values.

    3.2 What if the digit is just not within the middle of the picture

    In my earlier examples, I examine the filters to fastened areas of the picture. And one intuitive query that we are able to ask is what if the thing is just not centered. Sure, it may be at any place on a picture.

    The answer is sadly very primary: you slide the filter throughout the picture.

    Let’s take a easy instance once more: the dimension of the enter picture is 10×14. The peak is just not modified, and the width is 14.

    So the filter continues to be 10 x 10, and it’ll slide horizontally throughout the picture. Then, we are going to get 5 cross-correlation.

    We have no idea the place the picture is, however it’s not an issue as a result of we are able to simply get the max worth of the 5 the-cross correlations.

    That is what we name max pooling layer.

    Max pooling in a easy CNN – Picture by creator

    3.3 Different Operations Utilized in CNNs

    We attempt to clarify, why every part is helpful in a CNN.

    An important part is the cross-correlation between the enter and the filters. And we additionally clarify that small filters could be helpful, and the way max pooling handles objects that may be anyplace in a picture.

    There are additionally different steps generally utilized in CNNs, reminiscent of utilizing a number of layers in a row or making use of non-linear activation capabilities.

    These steps make the mannequin extra versatile, extra sturdy, and capable of be taught richer patterns.

    Why are they helpful precisely?

    I’ll go away this query to you as an train.

    Now that you simply perceive the core concept, attempt to consider how every of those steps helps a CNN go additional, and you may attempt to consider some concrete instance in Excel.

    Conclusion

    Simulating a CNN in Excel is a enjoyable and sensible strategy to see how machines acknowledge photographs.

    By working with small matrices and easy filters, we are able to perceive the primary steps of a CNN.

    I hope this text gave you some meals for considered what deep studying actually is. The distinction between machine studying and deep studying is just not solely about how deep the mannequin is, however about the way it works with representations of photographs and information.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.

    April 19, 2026

    Proxy-Pointer RAG: Structure Meets Scale at 100% Accuracy with Smarter Retrieval

    April 19, 2026

    Dreaming in Cubes | Towards Data Science

    April 19, 2026

    AI Agents Need Their Own Desk, and Git Worktrees Give Them One

    April 18, 2026

    Your RAG System Retrieves the Right Data — But Still Produces Wrong Answers. Here’s Why (and How to Fix It).

    April 18, 2026

    Europe Warns of a Next-Gen Cyber Threat

    April 18, 2026

    Comments are closed.

    Editors Picks

    AI Machine-Vision Earns Man Overboard Certification

    April 20, 2026

    Battery recycling startup Renewable Metals charges up on $12 million Series A

    April 20, 2026

    The Influencers Normalizing Not Having Sex

    April 20, 2026

    Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    US Congressman raises concerns about Kalshi/CNN partnership in public letter

    December 19, 2025

    Modular command center moves app control off your screen

    December 1, 2025

    A million dollar Honda motorcycle? Maybe two?

    January 17, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.