GpuAccelerationConfig.GpuInferencePriority
Stay organized with collections
Save and categorize content based on your preferences.
Relative priorities given by the GPU delegate to different client needs. Ordered
priorities provide better control over desired semantics, where priority(n) is more important
than priority(n+1), therefore, each time inference engine needs to make a decision, it uses
ordered priorities to do so.
For example: GPU_PRIORITY_MAX_PRECISION
at priority(1) would not allow to
decrease precision, but moving it to priority(2) or priority(3) would result in F16
calculation.
GPU_PRIORITY_AUTO
can only be used when higher priorities are fully specified.
Inherited Method Summary
From class java.lang.Enum
From class java.lang.Object
Object
|
clone()
|
boolean |
|
void |
finalize()
|
final Class<?>
|
getClass()
|
int |
hashCode()
|
final void |
notify()
|
final void |
notifyAll()
|
String
|
toString()
|
final void |
wait(long arg0, int arg1)
|
final void |
wait(long arg0)
|
final void |
wait()
|
From interface java.lang.constant.Constable
abstract Optional<? extends ConstantDesc>
|
describeConstable()
|
From interface java.lang.Comparable
Enum Values
public static final
GpuAccelerationConfig.GpuInferencePriority GPU_PRIORITY_AUTO
public static final
GpuAccelerationConfig.GpuInferencePriority
GPU_PRIORITY_MAX_PRECISION
Maximum precision GPU priority.
public static final
GpuAccelerationConfig.GpuInferencePriority
GPU_PRIORITY_MIN_LATENCY
Minimum latency GPU priority.
public static final
GpuAccelerationConfig.GpuInferencePriority
GPU_PRIORITY_MIN_MEMORY_USAGE
Minimum memory usage GPU priority.
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-11-18 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-11-18 UTC."],[[["`GpuAccelerationConfig.GpuInferencePriority` defines relative priorities for GPU delegate inference, with lower priority numbers indicating higher importance."],["Priority order influences inference decisions like precision, with higher priorities overriding lower ones (e.g., `GPU_PRIORITY_MAX_PRECISION` at priority 1 prevents precision reduction)."],["`GPU_PRIORITY_AUTO` is usable only when higher priorities are fully defined."],["The enum provides four priority levels: `GPU_PRIORITY_AUTO`, `GPU_PRIORITY_MAX_PRECISION`, `GPU_PRIORITY_MIN_LATENCY`, and `GPU_PRIORITY_MIN_MEMORY_USAGE`."]]],[]]