Direct API requests to `extraction-api`

You can use HTTP API to extract data directly from the extraction-api component.

Note

extraction-api is the component that sf-api relies on for object detection and feature vector and attribute extraction. It is stateless and operates purely on request-response basis.

Tip

Normalized images received from extraction-api are qualified for posting to sf-api and vice versa.

In this section:

Structure
- API Requests structure
- API Response Structure
API V2
API V1

Structure 

API Requests structure 

The extraction-api component accepts requests to http://<extraction-api_ip>:18666/.

There are an API v1 and v2 (multi-object).

Important

Differences:

The API v1 is selected automatically, if the /v2 path is not specified in the request URL.
The API v1 does not support variants of the attribute extractors. Read more here.
The API v2 defines the object type and returns response in appropriate fields. Also, it has new unified attribute names.

There are 2 ways to format the request body:

application/json: the request body contains only JSON.
multipart/form-data: the request body contains a JSON part with the request itself, other body parts are used for image transfer.

The JSON part of the request body contains a set of requests:

{
    "requests": [request1, request2, .., requestN]
    "include_timings": true|false // include face processing timing in response, false by default
    "response_format": "msgpack" // return response in msgpack format
}

API Response Structure 

A typical response from the extraction-api component contains a set of responses to the requests wrapped into the main API request:

{
    "response": [response1, response2, .., responseN]
}

API V2 

API Request Format 

Each request in the set applies to a specific image or region in the image and accepts the following parameters:

"image": an uploaded image (use multipart:part to refer to a relevant request body part), or a publicly accessible image URL (http:, https:).
"roi": a region of interest in the image. If the region is not specified, the entire image is processed.
"detector": an object detector to apply to the image (face, body, car and etc. or prenormalized or original). The prenormalized mode accepts normalized object images and omits detecting objects.
"object_type": a required parameter, if the value of the detector is original. For example, to extract face_liveness from the original image the value must be face. For the models that do not define an object type, e.g., frameattr/frameattr.crowdcount.v0, the value must be none.
"bbox": object bbox. It is used for extraction from original, extraction will be from the image with that value of bbox.
"need_normalized": returns a normalized object image encoded in base64. The normalized image can then be posted again to the extraction-api component as prenormalized.
"auto_rotate": if true, auto-rotates an original image to 4 different orientations and returns objects detected in each orientation.
"quality_estimator": if false, detection_score returns from the detector without face_quality attribute extract.
"attributes": array of strings in the format ["face_gender", "face_age", "face_emotions"], enables recognition of the object features passed in the array. Attribute name contains object type as prefix (face_, body_ and etc.). If a variant is used for an attribute extractor, specify it after a pipe, e.g., ["face_age|v3"]. If you need a default extractor, it’s not necessary to specify it as a variant.

{
    "image": "http://static.findface.pro/sample.jpg",
    "roi": {"left": 0, "right": 1000, "top": 0, "bottom": 1000},
    "detector": "face",
    "need_normalized": true,
    "auto_rotate": true,
    "attributes": ["face_emben", "face_gender", "face_age", "face_age|v3", "face_emotions", "face_beard", "face_glasses3"]
}

API Response Format 

Each response in the set contains the following JSON data:

"objects": a structure with sets of detected objects in the provided image or region of interest.
"error": an error occurred during processing (if any). The error body includes the error code which can be interpreted automatically ("code") and a human-readable description ("desc").
"timings": processing timings if "include_timings": true.

{
    "objects": {
        "face": [...] // detected face objects
        "car": [...], // detected car objects
        "head": [...] // detected head objects
        "body": [...] // detected body objects
    },
    "timings": ... // timings if requested
}

Each object in the set is provided with the following data:

"group_id": detection group identifier. All bboxes from one detect will have the same value. For example, used for N-in-1 detectors for grouping head, body, face from 3-in-1 detector.
"bbox": coordinates of a bounding box with the object.
"detection_score": either the object detection accuracy, or the object quality score. Upright objects in frontal position are considered the best quality.
"rotation_angle": if "auto_rotate":true, returns the angle at which an object was detected.
"attributes": attributes with extracted results. As a key uses a full attribute name with a variant, specified after a pipe (if any) (face_age, face_age|v3, face_gender), as a value — objects with the following data:
- "extractor": extractor name.
- "model": name of the extractor model.
- "result": extraction result, may be different types for different extractors.
"normalized": a normalized face image encoded in base64, if requested.
"timings": face processing timings, if requested.

{
    "group_id": "28c97d15",
    "bbox": { "left": 1, "right": 2, "top": 3, "bottom": 4},
    "detection_score": 0.99,
    "normalized": "...",
    "attributes": {
      "face_age": {
          "extractor": "face_age",
          "model": "age.v2",
          "result": 25
      },
      "face_age|v3": {
          "extractor": "face_age",
          "model": "age.v3",
          "result": 25
      },
      "face_beard": {
          "extractor": "face_beard",
          "model": "beard.v0",
          "result": [
              { "confidence": 0.015328666, "name": "beard" }
          ]
      },
      "face_emotions": {
          "extractor": "face_emotions",
          "model": "emotions.v1",
          "result": [
              { "confidence": 0.99959123, "name": "neutral" },
              { "confidence": 0.00039093022, "name": "sad" },
              { "confidence": 8.647058e-06, "name": "happy" },
              { "confidence": 7.994732e-06, "name": "surprise" },
              { "confidence": 6.495376e-07, "name": "disgust" },
              { "confidence": 6.063106e-07, "name": "angry" },
              { "confidence": 7.077886e-10, "name": "fear" }
          ]
      },
      ...
    }
    "timings": ...
}

Examples 

Request #1

curl -X POST -F sample=@sample.jpg -F 'request={"requests":[{"image":"multipart:sample", "detector":"face", "attributes": ["face_age", "face_gender", "face_emben"]}]}' http://127.0.0.1:18666/v2 | jq .

Response

{
  "responses": [
    {
      "faces": null,
      "objects": {
        "face": [
          {
            "group_id": "b781670d",
            "bbox": {
              "left": 168,
              "top": 338,
              "right": 812,
              "bottom": 1234
            },
            "detection_score": 0.7689582,
            "rotation_angle": 0,
            "attributes": {
              "face_age": {
                "extractor": "face_age",
                "model": "age.v2",
                "result": 47
              },
              "face_emben": {
                "extractor": "face_emben",
                "model": "kiwi_320",
                "result": "..."
              },
              "face_gender": {
                "extractor": "face_gender",
                "model": "gender.v2",
                "result": [
                  {
                    "confidence": 1,
                    "name": "male"
                  },
                  {
                    "confidence": 5.503795e-08,
                    "name": "female"
                  }
                ]
              }
            }
          }
        ]
      },
      "orientation": 1,
      "detector": "face_jasmine"
    }
  ]
}

Important

If the requested attribute is not found in the configuration file or isn’t loaded, or an object attribute does not match detect object type, this attribute will be ignored in the returned response.

Important

If need_normalized: true is specified in the request, normalization from "objects:object:base_normalization" config field will be used. If there is no base normalization in config, the default normalization will be used.

Request #2 A simple request with a 3-in-1 headbodyface detector

curl -s -X POST -F sample=@sample_3in1.jpg -F 'request={"requests":[{"image":"multipart:sample", "detector":"headbodyface", "attributes": ["face_emben", "body_emben", "head_motohelmet"]}]}' http://127.0.0.1:18666/v2 | jq

Response

{
  "responses": [
    {
      "faces": null,
      "objects": {
        "face": [
          {
            "group_id": "43c199aa",
            "bbox": {
              "left": 616,
              "top": 232,
              "right": 645,
              "bottom": 266
            },
            "detection_score": 0.67829776,
            "rotation_angle": 0,
            "attributes": {
              "face_emben": {
                "extractor": "face_emben",
                "model": "kiwi_320",
                "result": "..."
              }
            }
          }
        ],
        "head": [
          {
            "group_id": "43c199aa",
            "bbox": {
              "left": 615,
              "top": 225,
              "right": 652,
              "bottom": 270
            },
            "detection_score": 0.94091797,
            "rotation_angle": 0,
            "attributes": {
              "head_motohelmet": {
                "extractor": "head_motohelmet",
                "model": "headattr.motohelmet.v1",
                "result": 0.109558105
              }
            }
          }
        ],
        "body": [
          {
            "group_id": "43c199aa",
            "bbox": {
              "left": 544,
              "top": 220,
              "right": 691,
              "bottom": 468
            },
            "detection_score": 0.7998271,
            "rotation_angle": 0,
            "attributes": {
              "body_emben": {
                "extractor": "body_emben",
                "model": "andariel",
                "result": "..."
              }
            }
          }
        ]
      },
      "orientation": 1,
      "detector": "headbodyface"
    }
  ]
}

Warning

headbodyface detector must be enabled in the extraction-api.yaml configuration file.

detectors:
    max_batch_size: 1
    instances: 1
    models:
    headbodyface:
      aliases:
      - headbodyface
      model: detector/headbodyface.gpu.fnk
      options:
        min_object_size: 32
        resolutions: [2048x2048]

Request #3 Request with "frameattr" extraction

curl -s -X POST -F sample=@/home/crowd.jpg -F 'request={"requests":[{"image":"multipart:sample", "detector":"original", "attributes":["crowd_count"], "object_type": "none"}]}' http://127.0.0.1:18666/v2/ | jq

Response

{
  "responses": [
    {
      "faces": null,
      "objects": {
        "none": [
          {
            "group_id": "",
            "bbox": {
              "left": 0,
              "top": 0,
              "right": 1276,
              "bottom": 608
            },
            "detection_score": 1,
            "rotation_angle": 0,
            "attributes": {
              "crowd_count": {
                "extractor": "crowd_count",
                "model": "frameattr.crowdcount.v0",
                "result": {
                  "count": 690.92346,
                  "heatmap_height": 76,
                  "heatmap_image": "...",
                  "heatmap_image_multiplier": 0.9641899,
                  "heatmap_width": 159
                }
              }
            }
          }
        ]
      },
      "orientation": 1,
      "detector": "original"
    }
  ]
}

Request #4 Request with liveness extraction from original

curl -s -X POST -F sample=@/home/sample2.jpg -F 'request={"requests":[{"image":"multipart:sample", "detector":"original", "attributes":["face_liveness", "face_emben"], "bbox": {"left": 10, "top": 10, "right": 444, "bottom": 444}, "object_type": "face"}]}' http://127.0.0.1:18666/v2/ | jq

Response

{
  "responses": [
    {
      "faces": null,
      "objects": {
        "face": [
          {
            "group_id": "",
            "bbox": {
              "left": 10,
              "top": 10,
              "right": 445,
              "bottom": 445
            },
            "detection_score": 1,
            "rotation_angle": 0,
            "attributes": {
              "face_emben": {
                "extractor": "face_emben",
                "model": "kiwi_160",
                "result": "..."
              },
              "face_liveness": {
                "extractor": "face_liveness",
                "model": "liveness.pvn.v0",
                "result": 0.77615935
              }
            }
          }
        ]
      },
      "orientation": 1,
      "detector": "original"
    }
  ]
}

Request #5 Request to an extractor with a specified variant

curl -X POST -F sample=@sample.jpeg -F 'request={"requests":[{"image":"multipart:sample", "detector":"face", "attributes": ["face_liveness", "face_liveness|default", "face_liveness|web", "face_liveness|pvn"]}]}' http://127.0.0.1:18666/v2 | jq

Response

{
  "responses": [
    {
      "faces": null,
      "objects": {
        "face": [
          {
            "group_id": "10e1cede",
            "bbox": {
              "left": 103,
              "top": 77,
              "right": 195,
              "bottom": 200
            },
            "detection_score": 0.94848424,
            "rotation_angle": 0,
            "attributes": {
              "face_liveness": {
                "extractor": "face_liveness",
                "model": "liveness.pacs.v2",
                "result": 0.47667715
              },
              "face_liveness|default": {
                "extractor": "face_liveness",
                "model": "liveness.pacs.v2",
                "result": 0.47667715
              },
              "face_liveness|pvn": {
                "extractor": "face_liveness",
                "model": "liveness.pvn.v2",
                "result": 0.0770323
              },
              "face_liveness|web": {
                "extractor": "face_liveness",
                "model": "liveness.web.v0",
                "result": 0.4316247
              }
            }
          }
        ]
      },
      "orientation": 1,
      "detector": "jasmine"
    }
  ]
}

A specific extractor is addressed by adding |variant to the extractor type (e.g., face_liveness|default, face_liveness|pvn, etc.). Attributes are returned as specified in the request. Thus, although requests with the face_liveness and face_liveness|default attributes appeal to the same extractor, they are returned as two separate attributes in the response.

Method GET /v2/models-info 

This method returns the information about enabled detectors, normalizers, extractors and objects.

Request

curl -s  http://127.0.0.1:18666/v2/models-info | jq

Response

{
  "detectors": {
    "body": {
      "object_types": [
        "body"
      ]
    },
    "car": {
      "object_types": [
        "car"
      ]
    },
    "gustav_body": {
      "object_types": [
        "body"
      ]
    },
    "gustav_car": {
      "object_types": [
        "car"
      ]
    },
    "headbodyface": {
      "object_types": [
        "head",
        "body",
        "face"
      ]
    },
    "license_plate": {
      "object_types": [
        "license_plate"
      ]
    },
    "license_plate_gustav_accurate": {
      "object_types": [
        "license_plate"
      ]
    },
    "shiloette": {
      "object_types": [
        "body"
      ]
    }
  },
  "normalizers": {
    "carlicplate": {
      "normalization_type": "carlicplate"
    },
    "cropbbox": {
      "normalization_type": "cropbbox"
    },
    "multicrop_full_crop2x": {
      "normalization_type": "multicrop_full_crop2x"
    },
    "norm200": {
      "normalization_type": "norm200"
    }
  },
  "extractors": {
    "car_color": {
      "normalization": "crop1x",
      "model_name": "carattr_color.v0"
    },
    "car_quality": {
      "normalization": "cropbbox",
      "model_name": "carattr.quality.v0"
    },
    "face_emben": {
      "normalization": "norm200",
      "model_name": "kiwi_160"
    },
    "face_liveness": {
      "normalization": "multicrop_full_crop2x",
      "model_name": "faceattr.liveness_web.v1"
    },
    "face_liveness|mobile": {
      "normalization": "multicrop_full_center",
      "model_name": "faceattr.liveness_mobile.hart"
    },
    "face_quality": {
      "normalization": "crop1x",
      "model_name": "quality_fast.v1"
    },
    "license_plate_quality": {
      "normalization": "cropbbox",
      "model_name": "carlicplateattr.quality.v0"
    }
  },
  "objects": {
    "car": {
      "quality_attribute": "car_quality",
      "base_normalizer": "cropbbox"
    },
    "face": {
      "quality_attribute": "face_quality",
      "base_normalizer": "crop2x"
    },
    "license_plate": {
      "quality_attribute": "license_plate_quality",
      "base_normalizer": "carlicplate"
    }
  }
}

API V1 

API Request Format 

Each request in the set applies to a specific image or region in the image and accepts the following parameters:

Important

To enable recognition of face features, you can use either the new (preferred) or old API parameters. The old API allows you to recognize gender, age, and emotions, while the new API provides recognition of gender, age, emotions, beard, and glasses. Each face feature (gender, age, emotions, beard, or glasses) must be mentioned only once in a request, either in the new or old API format.

"image": an uploaded image (use multipart:part to refer to a relevant request body part), or a publicly accessible image URL (http:, https:).
"roi": a region of interest in the image. If the region is not specified, the entire image is processed.
"detector": a face detector to apply to the image (legacy, nnd or prenormalized). The prenormalized mode accepts normalized face images and omits detecting faces. Use nnd if you need to estimate the face quality ("quality_estimator": true).
"need_facen": if true, the request returns a facen in the response.
"need_gender": returns gender (old API).
"need_emotions": returns emotions (old API).
"need_age": returns age (old API).
"need_normalized": returns a normalized face image encoded in base64. The normalized image can then be posted again to the extraction-api component as “prenormalized”.
"auto_rotate": if true, auto-rotates an original image to 4 different orientations and returns faces detected in each orientation. Works only if "detector": "nnd" and "quality_estimator": true.
"attributes": array of strings in the format ["gender", "age", "emotions", "beard", "glasses3"], enables recognition of the face features passed in the array (new API).

{
    "image": "http://static.findface.pro/sample.jpg",
    "roi": {"left": 0, "right": 1000, "top": 0, "bottom": 1000},
    "detector": "nnd",
    "need_facen": true,
    "need_gender": true,
    "need_emotions": true,
    "need_age": true,
    "need_normalized": true,
    "auto_rotate": true
}

API Response Format 

Each response in the set contains the following JSON data:

"faces": a set of faces detected in the provided image or region of interest.
"error": an error occurred during processing (if any). The error body includes the error code which can be interpreted automatically ("code") and a human-readable description ("desc").
"facen_model": face extraction model if "need_facen": true.
"timings": processing timings if "include_timings": true.

{
    "faces": [face1, face2, .., faceN],
    "error": {
        "code": "IMAGE_DECODING_FAILED",
        "desc": "Failed to decode: reason"
    }
    "facen_model": "elderberry_576",
    "timings": ...

}

Each face in the set is provided with the following data:

"bbox": coordinates of a bounding box with the face.
"detection_score": either the face detection accuracy, or the face quality score (depending on whether quality_estimator is false or true in the extraction-api.yaml). Upright faces in frontal position are considered the best quality.
"facen": face feature vector.
"gender": gender information (MALE or FEMALE) with recognition accuracy if requested (old API).
"age": age estimate if requested (old API).
"emotions": all available emotions in descending order of probability if requested (old API).
"attributes": gender (male or female), age (number of years), emotions (predominant emotion), beard (beard or none), glasses (sun, eye, or none), along with algorithm confidence in the result if requested (new API).
"normalized": a normalized face image encoded in base64, if requested.
"timings": face processing timings, if requested.

{
 "bbox": { "left": 1, "right": 2, "top": 3, "bottom": 4},
 "detection_score": 0.99,
 "facen": "...",
 "gender": {
     "gender": "MALE",
     "score": "1.123"
 },
 "age": 23.59,
 "emotions": [
     { "emotion": "neutral", "score": 0.95 },
     { "emotion": "angry", "score": 0.55 },
     ...
 ],
 "normalized": "...",
 "attributes": {
   "age": {
       "attribute": "age",
       "model": "age.v1",
       "result": 25
   },
   "beard": {
       "attribute": "beard",
       "model": "beard.v0",
       "result": [
           { "confidence": 0.015328666, "name": "beard" }
       ]
   },
   "emotions": {
       "attribute": "emotions",
       "model": "emotions.v1",
       "result": [
           { "confidence": 0.99959123, "name": "neutral" },
           { "confidence": 0.00039093022, "name": "sad" },
           { "confidence": 8.647058e-06, "name": "happy" },
           { "confidence": 7.994732e-06, "name": "surprise" },
           { "confidence": 6.495376e-07, "name": "disgust" },
           { "confidence": 6.063106e-07, "name": "angry" },
           { "confidence": 7.077886e-10, "name": "fear"               }
       ]
   },
   "gender": {
       "attribute": "gender",
       "model": "gender.v2",
       "result": [
           { "confidence": 0.999894, "name": "female" },
           { "confidence": 0.00010597264, "name": "male" }
       ]
   },
   "glasses3": {
       "attribute": "glasses3",
       "model": "glasses3.v0",
       "result": [
           { "confidence": 0.9995815, "name": "none" },
           { "confidence": 0.0003348241, "name": "eye" },
           { "confidence": 8.363914e-05, "name": "sun" }
       ]
   }
 }
 "timings": ...
}

Examples 

Request #1

curl -X POST -F sample=@sample.jpg -F 'request={"requests":[{"image":"multipart:sample","detector":"nnd", "need_gender":true, "need_normalized": true, "need_facen": true}]}' http://127.0.0.1:18666/| jq

Response

{
  "responses": [
    {
      "faces": [
        {
          "bbox": {
            "left": 595,
            "top": 127,
            "right": 812,
            "bottom": 344
          },
          "detection_score": -0.0012599,
          "facen": "qErDPTE...vd4oMr0=",
          "gender": {
            "gender": "FEMALE",
            "score": -2.6415858
          },
          "normalized": "iVBORw0KGgoAAAANSUhE...79CIbv"
        }
      ]
    }
  ]
}

Request #2

curl -X POST  -F 'request={"requests": [{"need_age": true, "need_gender": true, "detector": "nnd", "roi": {"left": -2975, "top": -635, "right": 4060, "bottom": 1720}, "image": "https://static.findface.pro/sample.jpg", "need_emotions": true}]}' http://127.0.0.1:18666/ |jq

Response

{
  "responses": [
    {
      "faces": [
        {
          "bbox": {
            "left": 595,
            "top": 127,
            "right": 812,
            "bottom": 344
          },
          "detection_score": 0.9999999,
          "gender": {
            "gender": "FEMALE",
            "score": -2.6415858
          },
          "age": 26.048346,
          "emotions": [
            {
              "emotion": "neutral",
              "score": 0.90854686
            },
            {
              "emotion": "sad",
              "score": 0.051211596
            },
            {
              "emotion": "happy",
              "score": 0.045291856
            },
            {
              "emotion": "surprise",
              "score": -0.024765536
            },
            {
              "emotion": "fear",
              "score": -0.11788454
            },
            {
              "emotion": "angry",
              "score": -0.1723868
            },
            {
              "emotion": "disgust",
              "score": -0.35445923
            }
          ]
        }
      ]
    }
  ]
}

Request #3. Auto-rotation

curl -s -F 'sample=@/path/to/your/photo.png' -F 'request={"requests":[{"image":"multipart:sample","detector":"nnd", "auto_rotate": true, "need_normalized": true }]}' http://192.168.113.79:18666/

Response

{
 "responses": [
   {
     "faces": [
       {
         "bbox": {
           "left": 96,
           "top": 99,
           "right": 196,
           "bottom": 198
         },
         "detection_score": -0.00019264,
         "normalized": "iVBORw0KGgoAAAANSUhE....quWKAAC"
        },
       {
         "bbox": {
           "left": 205,
           "top": 91,
           "right": 336,
           "bottom": 223
         },
         "detection_score": -0.00041600747,
         "normalized": "iVBORw0KGgoAAAANSUhEUgAA....AByquWKAACAAElEQVR4nKy96XYbybIdnF"
       }
     ]
   }
 ]
}

Request #4. New API usage (attributes: “beard”, “emotions”, “age”, “gender”, “glasses3”, “face”)

curl -s -F photo=@sample.jpg -Frequest='{"requests": [{"image":"multipart:photo", "detector": "nnd", "attributes": ["beard", "emotions", "age", "gender", "glasses3", "face"]}]}' http://127.0.0.1:18666 | jq

Response

{
  "responses": [
    {
      "faces": [
        {
          "bbox": {
            "left": 595,
            "top": 127,
            "right": 812,
            "bottom": 344
          },
          "detection_score": -0.00067401276,
          "rotation_angle": 0,
          "attributes": {
            "age": {
              "attribute": "age",
              "model": "age.v1",
              "result": 25
            },
            "beard": {
              "attribute": "beard",
              "model": "beard.v0",
              "result": [
                {
                  "confidence": 0.015324414,
                  "name": "beard"
                }
              ]
            },
            "emotions": {
              "attribute": "emotions",
              "model": "emotions.v1",
              "result": [
                {
                  "confidence": 0.99958,
                  "name": "neutral"
                },
                {
                  "confidence": 0.0004020365,
                  "name": "sad"
                },
                {
                  "confidence": 8.603454e-06,
                  "name": "happy"
                },
                {
                  "confidence": 8.076766e-06,
                  "name": "surprise"
                },
                {
                  "confidence": 6.6535216e-07,
                  "name": "disgust"
                },
                {
                  "confidence": 6.1434775e-07,
                  "name": "angry"
                },
                {
                  "confidence": 7.3372125e-10,
                  "name": "fear"
                }
              ]
            },
            "face": {
              "attribute": "face",
              "model": "elderberry_576",
              "result": "KjiHu6cWh70ppqa9l"
            },
            "gender": {
              "attribute": "gender",
              "model": "gender.v2",
              "result": [
                {
                  "confidence": 0.9998938,
                  "name": "female"
                },
                {
                  "confidence": 0.000106243206,
                  "name": "male"
                }
              ]
            },
            "glasses3": {
              "attribute": "glasses3",
              "model": "glasses3.v0",
              "result": [
                {
                  "confidence": 0.99958307,
                  "name": "none"
                },
                {
                  "confidence": 0.00033243417,
                  "name": "eye"
                },
                {
                  "confidence": 8.4465064e-05,
                  "name": "sun"
                }
              ]
            }
          }
        }
      ],
      "orientation": 1
    }
  ]
}

Direct API requests to extraction-api

Direct API requests to `extraction-api`