Data Types in Lemnisk
Lemnisk captures all the User Behaviour and User Attributes information via various parameters, be it in App/Web Events, REST API or Offline Files. Listed below are the data types that Lemnisk supports while ingesting data.
Lemnisk will govern the data sanity when it receives real-time events from your App, Wesbite or REST API. Any deviations to the defined data type (defined in Events Dictionary or Customer Properties) of the incoming event, will result in failing of the event, which will be captured and reported in Event telemetry.
STRING
Equals, Not equals, Contains, Does not contain, Starts with, Ends with, Is Present
Is Present → True/False
True: The data point exists and is not empty/null/undefined
False: The data doesn't exist or it's empty/null/undefined
A sequence of characters, such as letters, numbers, symbols, and spaces typically used to represent text-based information.
NUMBER
Equals, Not equals, Contains, Does not contain, Starts with, Ends with, Greater than, Less than, Is Present
Is Present → True/False
This represents numerical values, which can be either integers (whole numbers) or floating-point numbers (numbers with decimal points).
Is Present (checks for value)
Is Present → True/False
True: The data point exists and is not empty/null/undefined
False: The data doesn't exist or it's empty/null/undefined
An Email data type represents an electronic mail address. It is a specialised form of a string that adheres to a specific format.
If Email is defined as the data type, the CDP will hash and store the respective values as a security measure to store PII data.
As the data is hashed and stored as STRING, it can be accessible for targeting.
PHONE
Is Present (checks for value)
Is Present → True/False
True: The data point exists and is not empty/null/undefined
False: The data doesn't exist or it's empty/null/undefined
This data type is specifically designed to store telephone numbers which often accommodates various international formats and may include country codes or extensions.
If Phone or Email are chosen, the CDP will hash and store the respective values, making it accessible for targeting.
ENTITY
Properties within the Entity are further defined as one of the above 4 data types, and segmentation rules apply to the Entity properties, not the parent Entity as a whole.
Entity data type is used to store multiple products/purchases information against a single user, where numerous details of every purchase is to be stored, and communicated for. For example Policies, Leads, Flight Bookings etc.
Key Considerations when defining Data Types
When defining data types for customer properties and event properties within the Lemnisk CDP, it's crucial to understand the implications of your choices.
1. Validation and Consistency
Source Validation: Properties defined in the identify event and otherIds section of a track event are rigorously validated against the Customer Properties section. This means any incoming data must conform to the predefined data type.
Offline File Validation: All properties imported via offline files are also subject to the same data type validation and will be displayed in Customer Properties.
Strict Type Checking: If an incoming customer property has an invalid data type (e.g., a
number
sent for aString
field), it will be reported as a failure in the Event Telemetry section. This ensures data integrity and governance of the model defined in the CDP.
2. Impact on Segmentation and Macro Personalisation
Segmentation: Data types directly influence how properties can be used for segmentation. For instance,
Number
types support numerical operators (e.g., greater than, less than), whileString
types allow for text-based comparisons.Data Type Changes: Modifying the data type of an existing Customer Property or Event property is not allowed. This is because the earlier history of data ingested cannot be modified, and hence, future segmentation would be infeasible. If you wish to change the data type of an existing property, simply create a new property and start using the same.
3. Special Data Types for User Identifiers and PII
Phone and Email Hashing: When you select Phone or Email as a data type for a customer property, the CDP will hash and store these values. This process makes the data targetable while potentially enhancing privacy by not storing the raw values in an unhashed format which is directly accessible.
Editing Restrictions: For User Identifiers and PII customer properties, users can only edit the Display Name. This restriction helps maintain the integrity of crucial identification fields.
4. Creating Customer Properties
When creating new customer properties, these fields are essential:
Name (Mandatory): This is the unique, actual name used for validation and internal referencing across identify events and offline files.
Display Name (Optional): This user-friendly name appears in macros and segmentation. If left empty, the
Name
will be used.Data Type (Mandatory): Currently, for newly created customer properties, the supported data types are String, Number, or Date.
Last updated