# Description This simplifies our OTEL setup by: * Getting rid of some deprecated things. * Using `autoexport` and letting things get configured by the `OTEL_` environment variables. * Removing all the unnecessary config options. ## Checklist Please put an x inside each checkbox to indicate that you've read and followed it: `[ ]` -> `[x]` If this is a documentation change, only the first checkbox must be filled (you can delete the others if you want). - [x] I/we have read the [GoToSocial contribution guidelines](https://codeberg.org/superseriousbusiness/gotosocial/src/branch/main/CONTRIBUTING.md). - [x] I/we have discussed the proposed changes already, either in an issue on the repository, or in the Matrix chat. - [x] I/we have not leveraged AI to create the proposed changes. - [x] I/we have performed a self-review of added code. - [x] I/we have written code that is legible and maintainable by others. - [ ] I/we have commented the added code, particularly in hard-to-understand areas. - [x] I/we have made any necessary changes to documentation. - [ ] I/we have added tests that cover new code. - [x] I/we have run tests and they pass locally with the changes. - [x] I/we have run `go fmt ./...` and `golangci-lint run`. Reviewed-on: https://codeberg.org/superseriousbusiness/gotosocial/pulls/4110 Reviewed-by: tobi <kipvandenbos@noreply.codeberg.org> Co-authored-by: Daenney <daenney@noreply.codeberg.org> Co-committed-by: Daenney <daenney@noreply.codeberg.org>
26 KiB
Logs API
Abstract
go.opentelemetry.io/otel/log provides
Logs API.
The prototype was created in #4725.
Background
The key challenge is to create a performant API compliant with the specification with an intuitive and user friendly design. Performance is seen as one of the most important characteristics of logging libraries in Go.
Design
This proposed design aims to:
- be specification compliant,
- be similar to Trace and Metrics API,
- take advantage of both OpenTelemetry and
slogexperience to achieve acceptable performance.
Module structure
The API is published as a single go.opentelemetry.io/otel/log Go module.
The package structure is similar to Trace API and Metrics API. The Go module consists of the following packages:
go.opentelemetry.io/otel/loggo.opentelemetry.io/otel/log/embeddedgo.opentelemetry.io/otel/log/logtestgo.opentelemetry.io/otel/log/noop
Rejected alternative:
LoggerProvider
The LoggerProvider abstraction
is defined as LoggerProvider interface in provider.go.
The specification may add new operations to LoggerProvider.
The interface may have methods added without a package major version bump.
This embeds embedded.LoggerProvider to help inform an API implementation
author about this non-standard API evolution.
This approach is already used in Trace API and Metrics API.
LoggerProvider.Logger
The Logger method implements the Get a Logger operation.
The required name parameter is accepted as a string method argument.
The LoggerOption options are defined to support optional parameters.
Implementation requirements:
-
The specification requires the method to be safe to be called concurrently.
-
The method should use some default name if the passed name is empty in order to meet the specification's SDK requirement to return a working logger when an invalid name is passed as well as to resemble the behavior of getting tracers and meters.
Logger can be extended by adding new LoggerOption options
and adding new exported fields to the LoggerConfig struct.
This design is already used in Trace API for getting tracers
and in Metrics API for getting meters.
Rejected alternative:
Logger
The Logger abstraction
is defined as Logger interface in logger.go.
The specification may add new operations to Logger.
The interface may have methods added without a package major version bump.
This embeds embedded.Logger to help inform an API implementation
author about this non-standard API evolution.
This approach is already used in Trace API and Metrics API.
Logger.Emit
The Emit method implements the Emit a LogRecord operation.
Context associated with the LogRecord
is accepted as a context.Context method argument.
Calls to Emit are supposed to be on the hot path.
Therefore, in order to reduce the number of heap allocations,
the LogRecord abstraction,
is defined as Record struct in record.go.
Timestamp
is accessed using following methods:
func (r *Record) Timestamp() time.Time
func (r *Record) SetTimestamp(t time.Time)
ObservedTimestamp
is accessed using following methods:
func (r *Record) ObservedTimestamp() time.Time
func (r *Record) SetObservedTimestamp(t time.Time)
EventName
is accessed using following methods:
func (r *Record) EventName() string
func (r *Record) SetEventName(s string)
SeverityNumber
is accessed using following methods:
func (r *Record) Severity() Severity
func (r *Record) SetSeverity(s Severity)
Severity type is defined in severity.go.
The constants are are based on
Displaying Severity recommendation.
Additionally, Severity[Level] constants are defined to make the API more readable and user friendly.
SeverityText
is accessed using following methods:
func (r *Record) SeverityText() string
func (r *Record) SetSeverityText(s string)
Body
is accessed using following methods:
func (r *Record) Body() Value
func (r *Record) SetBody(v Value)
Log record attributes are accessed using following methods:
func (r *Record) WalkAttributes(f func(KeyValue) bool)
func (r *Record) AddAttributes(attrs ...KeyValue)
Record has a AttributesLen method that returns
the number of attributes to allow slice preallocation
when converting records to a different representation:
func (r *Record) AttributesLen() int
The records attributes design and implementation is based on
slog.Record.
It allows achieving high-performance access and manipulation of the attributes
while keeping the API user friendly.
It relieves the user from making his own improvements
for reducing the number of allocations when passing attributes.
The abstractions described in the specification are defined in keyvalue.go.
Value is representing any.
KeyValue is representing a key(string)-value(any) pair.
Kind is an enumeration used for specifying the underlying value type.
KindEmpty is used for an empty (zero) value.
KindBool is used for boolean value.
KindFloat64 is used for a double precision floating point (IEEE 754-1985) value.
KindInt64 is used for a signed integer value.
KindString is used for a string value.
KindBytes is used for a slice of bytes (in spec: A byte array).
KindSlice is used for a slice of values (in spec: an array (a list) of any values).
KindMap is used for a slice of key-value pairs (in spec: map<string, any>).
These types are defined in go.opentelemetry.io/otel/log package
as they are tightly coupled with the API and different from common attributes.
The internal implementation of Value is based on
slog.Value
and the API is mostly inspired by
attribute.Value.
The benchmarks1 show that the implementation is more performant than
attribute.Value.
The value accessors (func (v Value) As[Kind] methods) must not panic,
as it would violate the specification:
API methods MUST NOT throw unhandled exceptions when used incorrectly by end users. The API and SDK SHOULD provide safe defaults for missing or invalid arguments. [...] Whenever the library suppresses an error that would otherwise have been exposed to the user, the library SHOULD log the error using language-specific conventions.
Therefore, the value accessors should return a zero value and log an error when a bad accessor is called.
The Severity, Kind, Value, KeyValue may implement
the fmt.Stringer interface.
However, it is not needed for the first stable release
and the String methods can be added later.
The caller must not subsequently mutate the record passed to Emit.
This would allow the implementation to not clone the record,
but simply retain, modify or discard it.
The implementation may still choose to clone the record or copy its attributes
if it needs to retain or modify it,
e.g. in case of asynchronous processing to eliminate the possibility of data races,
because the user can technically reuse the record and add new attributes
after the call (even when the documentation says that the caller must not do it).
Implementation requirements:
-
The specification requires the method to be safe to be called concurrently.
-
The method must not interrupt the record processing if the context is canceled per "ignoring context cancellation" guideline.
-
The specification requires use the current time as observed timestamp if the passed is empty.
-
The method should handle the trace context passed via
ctxargument in order to meet the specification's SDK requirement to populate the trace context fields from the resolved context.
Emit can be extended by adding new exported fields to the Record struct.
Rejected alternatives:
- Record as interface
- Options as parameter to Logger.Emit
- Passing record as pointer to Logger.Emit
- Logger.WithAttributes
- Record attributes as slice
- Use any instead of defining Value
- Severity type encapsulating number and text
- Reuse attribute package
- Mix receiver types for Record
- Add XYZ method to Logger
- Rename KeyValue to Attr
Logger.Enabled
The Enabled method implements the Enabled operation.
Context associated with the LogRecord
is accepted as a context.Context method argument.
Calls to Enabled are supposed to be on the hot path and the list of arguments
can be extendend in future. Therefore, in order to reduce the number of heap
allocations and make it possible to handle new arguments, Enabled accepts
a EnabledParameters struct, defined in logger.go, as the second
method argument.
The EnabledParameters uses fields, instead of getters and setters, to allow
simpler usage which allows configuring the EnabledParameters in the same line
where Enabled is called.
noop package
The go.opentelemetry.io/otel/log/noop package provides
Logs API No-Op Implementation.
Trace context correlation
The bridge implementation should do its best to pass
the ctx containing the trace context from the caller
so it can later be passed via Logger.Emit.
It is not expected that users (caller or bridge implementation) reconstruct
a context.Context. Reconstructing a context.Context with
trace.ContextWithSpanContext
and trace.NewSpanContext
would usually involve more memory allocations.
The logging libraries which have recording methods that accepts context.Context,
such us slog,
logrus,
zerolog,
makes passing the trace context trivial.
However, some libraries do not accept a context.Context in their recording methods.
Structured logging libraries,
such as logr
and zap,
offer passing any type as a log attribute/field.
Therefore, their bridge implementations can define a "special" log attributes/field
that will be used to capture the trace context.
The prototype has bridge implementations that handle trace context correlation efficiently.
Benchmarking
The benchmarks take inspiration from slog,
because for the Go team it was also critical to create API that would be fast
and interoperable with existing logging packages.23
The benchmark results can be found in the prototype.
Rejected alternatives
Reuse slog
The API must not be coupled to slog,
nor any other logging library.
The API needs to evolve orthogonally to slog.
slog is not compliant with the Logs API.
and we cannot expect the Go team to make slog compliant with it.
The interoperability can be achieved using a log bridge.
You can read more about OpenTelemetry Logs design on opentelemetry.io.
Record as interface
Record is defined as a struct because of the following reasons.
Log record is a value object without any behavior. It is used as data input for Logger methods.
The log record resembles the instrument config structs like metric.Float64CounterConfig.
Using struct instead of interface improves the performance as e.g.
indirect calls are less optimized,
usage of interfaces tend to increase heap allocations.3
Options as parameter to Logger.Emit
One of the initial ideas was to have:
type Logger interface{
embedded.Logger
Emit(ctx context.Context, options ...RecordOption)
}
The main reason was that design would be similar to the Meter API for creating instruments.
However, passing Record directly, instead of using options,
is more performant as it reduces heap allocations.4
Another advantage of passing Record is that API would not have functions like NewRecord(options...),
which would be used by the SDK and not by the users.
Finally, the definition would be similar to slog.Handler.Handle
that was designed to provide optimization opportunities.2
Passing record as pointer to Logger.Emit
So far the benchmarks do not show differences that would favor passing the record via pointer (and vice versa).
Passing via value feels safer because of the following reasons.
The user would not be able to pass nil.
Therefore, it reduces the possibility to have a nil pointer dereference.
It should reduce the possibility of a heap allocation.
It follows the design of slog.Handler.
If follows one of Google's Go Style Decisions to prefer passing values.
Passing struct as parameter to LoggerProvider.Logger
Similarly to Logger.Emit, we could have something like:
type LoggerProvider interface{
embedded.LoggerProvider
Logger(name string, config LoggerConfig)
}
The drawback of this idea would be that this would be a different design from Trace and Metrics API.
The performance of acquiring a logger is not as critical as the performance of emitting a log record. While a single HTTP/RPC handler could write hundreds of logs, it should not create a new logger for each log entry. The bridge implementation should reuse loggers whenever possible.
Logger.WithAttributes
We could add WithAttributes to the Logger interface.
Then Record could be a simple struct with only exported fields.
The idea was that the SDK would implement the performance improvements
instead of doing it in the API.
This would allow having different optimization strategies.
During the analysis5, it occurred that the main problem of this proposal is that the variadic slice passed to an interface method is always heap allocated.
Moreover, the logger returned by WithAttribute was allocated on the heap.
Lastly, the proposal was not specification compliant.
Record attributes as slice
One of the proposals6 was to have Record as a simple struct:
type Record struct {
Timestamp time.Time
ObservedTimestamp time.Time
EventName string
Severity Severity
SeverityText string
Body Value
Attributes []KeyValue
}
The bridge implementations could use sync.Pool
for reducing the number of allocations when passing attributes.
The benchmarks results were better.
In such a design, most bridges would have a sync.Pool
to reduce the number of heap allocations.
However, the sync.Pool will not work correctly with API implementations
that would take ownership of the record
(e.g. implementations that do not copy records for asynchronous processing).
The current design, even in case of improper API implementation,
has lower chances of encountering a bug as most bridges would
create a record, pass it, and forget about it.
For reference, here is the reason why slog does not use sync.Pool3
as well:
We can use a sync pool for records though we decided not to. You can but it's a bad idea for us. Why? Because users have control of Records. Handler writers can get their hands on a record and we'd have to ask them to free it or try to free it magically at some some point. But either way, they could get themselves in trouble by freeing it twice or holding on to one after they free it. That's a use after free bug and that's why
zerologwas problematic for us.zerologas as part of its speed exposes a pool allocated value to users if you usezerologthe normal way, that you'll see in all the examples, you will never encounter a problem. But if you do something a little out of the ordinary you can get use after free bugs and we just didn't want to put that in the standard library.
Therefore, we decided to not follow the proposal as it is
less user friendly (users and bridges would use e.g. a sync.Pool to reduce
the number of heap allocation), less safe (more prone to use after free bugs
and race conditions), and the benchmark differences were not significant.
Use any instead of defining Value
Logs Data Model
defines Body to be any.
One could propose to define Body (and attribute values) as any
instead of a defining a new type (Value).
First of all, any type defined in the specification
is not the same as any (interface{}) in Go.
Moreover, using any as a field would decrease the performance.7
Notice it will be still possible to add following kind and factories in a backwards compatible way:
const KindMap Kind
func AnyValue(value any) KeyValue
func Any(key string, value any) KeyValue
However, currently, it would not be specification compliant.
Severity type encapsulating number and text
We could combine severity into a single field defining a type:
type Severity struct {
Number SeverityNumber
Text string
}
However, the Logs Data Model define it as independent fields. It should be more user friendly to have them separated. Especially when having getter and setter methods, setting one value when the other is already set would be unpleasant.
Reuse attribute package
It was tempting to reuse the existing [https://pkg.go.dev/go.opentelemetry.io/otel/attribute] package for defining log attributes and body.
However, this would be wrong because the log attribute definition is different from the common attribute definition.
Moreover, it there is nothing telling that the body definition has anything in common with a common attribute value.
Therefore, we define new types representing the abstract types defined in the Logs Data Model.
Mix receiver types for Record
Methods of slog.Record
have different receiver types.
In log/slog GitHub issue we can only find that the reason is:8
some receiver of Record struct is by value Passing Records by value means they incur no heap allocation. That improves performance overall, even though they are copied.
However, the benchmarks do not show any noticeable differences.9
The compiler is smart-enough to not make a heap allocation for any of these methods. The use of a pointer receiver does not cause any heap allocation. From Go FAQ:10
In the current compilers, if a variable has its address taken, that variable is a candidate for allocation on the heap. However, a basic escape analysis recognizes some cases when such variables will not live past the return from the function and can reside on the stack.
The Understanding Allocations: the Stack and the Heap presentation by Jacob Walker describes the escape analysis with details.
Moreover, also from Go FAQ:10
Also, if a local variable is very large, it might make more sense to store it on the heap rather than the stack.
Therefore, even if we use a value receiver and the value is very large it may be heap allocated.
Both Go Code Review Comments and Google's Go Style Decisions highly recommend making the methods for a type either all pointer methods or all value methods. Google's Go Style Decisions even goes further and says:
There is a lot of misinformation about whether passing a value or a pointer to a function can affect performance. The compiler can choose to pass pointers to values on the stack as well as copying values on the stack, but these considerations should not outweigh the readability and correctness of the code in most circumstances. When the performance does matter, it is important to profile both approaches with a realistic benchmark before deciding that one approach outperforms the other.
Because, the benchmarks9 do not proof any performance difference
and the general recommendation is to not mix receiver types,
we decided to use pointer receivers for all Record methods.
Add XYZ method to Logger
The Logger does not have methods like SetSeverity, etc.
as the Logs API needs to follow (be compliant with)
the specification
Rename KeyValue to Attr
There was a proposal to rename KeyValue to Attr (or Attribute).11
New developers may not intuitively know that log.KeyValue is an attribute in
the OpenTelemetry parlance.
During the discussion we agreed to keep the KeyValue name.
The type is used in multiple semantics:
- as a log attribute,
- as a map item,
- as a log record Body.
As for map item semantics, this type is a key-value pair, not an attribute.
Naming the type as Attr would convey semantical meaning
that would not be correct for a map.
We expect that most of the Logs API users will be OpenTelemetry contributors. We plan to implement bridges for the most popular logging libraries ourselves. Given we will all have the context needed to disambiguate these overlapping names, developers' confusion should not be an issue.
For bridges not developed by us, developers will likely look at our existing bridges for inspiration. Our correct use of these types will be a reference to them.
At last, we provide ValueFromAttribute and KeyValueFromAttribute
to offer reuse of attribute.Value and attribute.KeyValue.