Android NDK (C)의 사용자 환경 이해

자체 앱에서 Scene Semantics API를 사용하는 방법을 알아보세요.

Scene Semantics API를 사용하면 개발자가 ML 모델 기반의 실시간 시맨틱 정보를 제공하여 사용자 주변의 장면을 이해할 수 있습니다. 야외 장면의 이미지가 주어지면 API는 하늘, 건물, 나무, 도로, 인도, 차량, 사람 등 유용한 시맨틱 클래스 집합에서 각 픽셀의 라벨을 반환합니다. Scene Semantics API는 픽셀 라벨 외에도 각 픽셀 라벨의 신뢰도 값과 야외 장면에서 특정 라벨의 보급률을 쿼리하는 간편한 방법을 제공합니다.

왼쪽에서 오른쪽으로 입력 이미지의 예, 픽셀 라벨의 시맨틱 이미지, 해당하는 신뢰도 이미지입니다.

입력 이미지, 시맨틱 이미지, 시맨틱 신뢰도 이미지의 예

기본 요건

기본 AR 개념을 이해합니다. 진행하기 전에 ARCore 세션을 구성하는 방법을 알아보세요.

장면 시맨틱 사용 설정

새로운 ARCore 세션에서 사용자의 기기가 Scene Semantics API를 지원하는지 확인합니다. 처리 능력 제약으로 인해 일부 ARCore 호환 기기는 Scene Semantics API를 지원하지 않습니다.

리소스를 저장하기 위해 ARCore에서는 Scene Semantics가 기본적으로 사용 중지되어 있습니다. 앱이 Scene Semantics API를 사용하도록 하려면 시맨틱 모드를 사용 설정하세요.

// Check whether the user's device supports the Scene Semantics API.
int32_t is_scene_semantics_supported = 0;
ArSession_isSemanticModeSupported(ar_session, AR_SEMANTIC_MODE_ENABLED, &is_scene_semantics_supported);

// Configure the session for AR_SEMANTIC_MODEL_ENABLED.
ArConfig* ar_config = NULL;
ArConfig_create(ar_session, &ar_config);
if (is_scene_semantics_supported) {
  ArConfig_setSemanticMode(ar_session, ar_config, AR_SEMANTIC_MODE_ENABLED);
}
CHECK(ArSession_configure(ar_session, ar_config) == AR_SUCCESS);
ArConfig_destroy(ar_config);

시맨틱 이미지 가져오기

Scene Semantics가 사용 설정되면 시맨틱 이미지를 가져올 수 있습니다. 시맨틱 이미지는 AR_IMAGE_FORMAT_Y8 이미지이며, 여기서 각 픽셀은 ArSemanticLabel로 정의된 시맨틱 라벨에 해당합니다.

ArFrame_acquireSemanticImage()를 사용하여 시맨틱 이미지를 가져옵니다.

// Retrieve the semantic image for the current frame, if available.
ArImage* semantic_image = NULL;
if (ArFrame_acquireSemanticImage(ar_session, ar_frame, &semantic_image) != AR_SUCCESS) {
  // No semantic image retrieved for this frame.
  // The output image may be missing for the first couple frames before the model has had a chance to run yet.
  return;
}
// If a semantic image is available, use it here.

출력 시맨틱 이미지는 기기에 따라 세션 시작 후 약 1~3프레임 후에 제공됩니다.

확신 이미지 가져오기

각 픽셀에 대한 라벨을 제공하는 의미론적 이미지 외에, API는 해당 픽셀 신뢰도 값의 신뢰도 이미지도 제공합니다. 신뢰도 이미지는 AR_IMAGE_FORMAT_Y8 이미지로, 각 픽셀은 [0, 255] 범위의 값에 해당하며, 이는 각 픽셀의 시맨틱 라벨과 연결된 확률에 해당합니다.

ArFrame_acquireSemanticConfidenceImage()를 사용하여 시맨틱 신뢰도 이미지를 획득합니다.

// Retrieve the semantic confidence image for the current frame, if available.
ArImage* semantic_confidence_image = NULL;
if (ArFrame_acquireSemanticConfidenceImage(ar_session, ar_frame, &semantic_confidence_image) != AR_SUCCESS) {
  // No semantic confidence image retrieved for this frame.
  // The output image may be missing for the first couple frames before the model has had a chance to run yet.
  return;
}
// If a semantic confidence image is available, use it here.

출력 확신도 이미지는 기기에 따라 세션 시작 후 약 1~3프레임 후에 사용할 수 있습니다.

의미론적 라벨의 픽셀 비율 쿼리

또한 현재 프레임에서 하늘과 같은 특정 클래스에 속하는 픽셀의 비율을 쿼리할 수도 있습니다. 이 쿼리는 시맨틱 이미지를 반환하고 특정 라벨에 대해 픽셀 단위로 검색하는 것보다 효율적입니다. 반환된 분수는 [0.0, 1.0] 범위의 부동 소수점 값입니다.

ArFrame_getSemanticLabelFraction()을 사용하여 지정된 라벨의 분수를 가져옵니다.

// Retrieve the fraction of pixels for the semantic label sky in the current frame.
float out_fraction = 0.0f;
if (ArFrame_getSemanticLabelFraction(ar_session, ar_frame, AR_SEMANTIC_LABEL_SKY, &out_fraction) != AR_SUCCESS) {
  // No fraction of semantic labels was retrieved for this frame.
}