AI in Smart Buildings #4 — Scenarios (10 min read)

Use SIFT (Scale Invariant Feature Transform) to identify key points in faces.
Obtain morphometric vectors of faces: These vectors represent the location of several the fiducial facial landmark points (mostly defining the facial contour and certain facial components). The calculation of these vectors can be done as a linear regression problem (ex., ridge, ElasticNet etc) to adjust a face model with the representative landmarks (contour, eyes, mouth, nose, …).
Perform content based indexing for key features: Techniques for indexing utilize similarity or distance as a measure, and are based on transforming the feature space into a smaller dimension space (using PCA, hierarchical clustering or tSNE) and then use this as the indexing criteria.
Store the content-indexed features in a Biometrics Database: There are existing technologies that support CBIR (Content-based image retrieval) or QBE (query by example) functionalities, e.g., IBM QBIC, Elastic Vision, or VIR Image Engine (Virage).

“The most basic information includes spaces (rooms/hallways) and their connectivity (doors and staircases). Therefore, the simplest option would be a graph model in which the nodes are spaces and edges represent how they are connected. This option can be extended by enriching the information for each node and edge with additional attributes. This can be categorical (such as type/function of the space — restroom, corridor, food court, …) or numerical (volume/surface area, dimensions and sizes, how long it takes to move across the room or cross a connection etc).”

Define concepts — classes and their relationships — in building regulations, as the initial step in creating the regulation ontology. This should include types of rules and regulations; types may be by part of building [roof vs floors vs structure], or function [HVAC vs security vs emergency procedures] etc. Three parts here — a) Use NLP techniques to extract summary level terms — chapter titles, headings, table of contents. b) Use these techniques to identify key concepts and those indicating regulatory constraints [sentences such as “corridors connecting open areas must be at least 5 meters wide”]. c) Identify hierarchical structures, e.g. a building has floors, the floors have spaces, … and their taxonomies [distinction between public areas and restricted areas, …]
Model individual rules to be loaded into the ontology. Four steps here — a) Apply Fundamental NLP techniques — lemmatization, stemming, parts-of-speech (POS) tagging, sentence parsing — to building regulations. b) Model sentence/paragraph structures to be used as input in creating ontologies. c) Extract rules and map them to concept/type of rule. d) This process of creating the ontology shall be led by an expert in knowledge representation, with assistance from NLP components in the definition of the KB.

“The simpler option is a discrete-event simulation (mostly a queue system) that identifies the number of people per node of the graph (spaces in the building). There is a given task the people perform in each node — like walking, shopping, looking at the store displayed items, going to the restroom etc — and the probability or plan to transit to the next node“ + “A solver may be used to chart pathways through the building. Depending on the size and complexity of the building, the number of theoretical pathways can be very large. For a simulation, a solver may be used to determine the relevant options out of the existing ones. Also, a solver may be used to react to situations within a mall — like a large crowd, or a closed corridor, or a broken elevator etc” + “If we choose the Discrete Event Simulation way, we will need to model Behaviour-Design-Intent as pre-processing for Step 7.”

To simply model occupation of areas, it can be solved as a queue system with a Markovian model. This approach limits the control mechanism that governs the agents and it only models density of people per area. In this case, the underlying behaviour model is a random walk.
A more complex definition of agents requires assigning goals and different strategies to pursue them, including at least a rule-based model or a controller based on behaviour trees. For discrete-event models, use software libraries like SimPy or languages like Simula. For agent-based models, counterparts include AgentPy, AgentBase etc. In either situation, since atomic operations are trivial, effort so far has focussed on developing software libraries, languages and apps for this simulation, as opposed to complex mathematical models such as neural networks.

“Crowd behaviour can be modelled as an extension of agent-planning. The easiest approach is to have a basic perception model and a simple internal state representation that allows the agents to switch between plans depending on some fixed rules (e.g. I’m tired so I am going to stop shopping, find a bench, and eat something or leave the mall). A more advanced model would consider the interpretation of the environment. There are computational cognitive models that implement perception and elicitation processes. If the simulation deals with, for instance, emergency simulations such as a fire or people evacuating the building in a hurry, these models better capture these situations. Regardless of simulation approach, a learning model is needed to characterize the type of crowd behaviour”

Input: Raw video sequence from the surveillance cameras & crowd model outcomes
Task: Segment faces in video, correct for perspective and occlusion (and so on), de-scale and extract facial feature vectors, and match to morphometric vectors from the biometrics database. Machine learning models, especially deep neural networks, are famous for not requiring hand-crafting of features and filters in image recognition so the first parts — segmenting, correcting and extracting — could potentially be set up as a data-driven one. The matching process would then utilize a simple distance measure to determine which morphometric vector the extracted facial feature vector is closest to.
Output: Name, Trespasser Yes/No, Degree of Certainty”

Convolutional Neural Network or any state-of-the-art Deep Learning Model for Image Recognition. This would be trained on a large database of raw video of people with scale-free morphometric vectors of faces as labels. The model will need to recognize all faces in video involving crowds, and extract scale-free morphometric vectors.
Simple distance measure to evaluate and identify best possible reference vectors for a given face in a video feed. Uses extracted morphometric vectors from the prior step.

“The core of the system can be a rule-based engine. These are either implemented in logic programming languages (e.g., Prolog or Lisp) or use an intermediate programming format to describe rules. This engine shall be able to propose a sequence of actions to deal with a particular situation based on the current state of the system (occupation of the building). Sometimes, these engines use extended implementations based on partial order planning. This extends the basic rule-based systems to deal with a problem by proposing a sequence of actions (several of which can be performed in alternative orders) to find the objective state of the system”.

Don't hesitate!

Design Thinking for Hero Methods

Machine Learning is King, but of narrow territory. Hero Methods do things that ML cannot. Taken together, not only do they help solve complex problems, they also lay the doorway to AI.

Milton, ON L9T 6T2, Canada
help@kado.ai
+1 416 619 0517

Created with

AI in Smart Buildings #4 — Scenarios

1. Feature Extraction

Method choice for initial build

Technique selection

Alternatives

2. Building Model

Method choice for initial build

Technique selection

Alternatives

3. Regulation Extraction

Method choice for initial build

Technique selection

Alternatives

4. Person Detector

Method choice for initial build

Technique selection

Alternatives

5. Pose Identifier

Method choice for initial build

Technique selection

Alternatives

6. Crowd Model

Method choice for initial build

Technique selection

Alternatives

7. Behaviour Model

Method choice for initial build

Technique selection

Alternatives

8. Surveillance

Method choice for initial build

Technique selection

Alternatives

9. Preventive Security Engine

Method choice for initial build

Technique selection

Alternatives

Design Thinking for Hero Methods

Connect with us

This is how we see the world

Legal Corner

Get in touch