Contents

Cancel

Recommended Articles

  1. unify-apps

    Indexing

    Unify AI

    Transform raw content into searchable knowledge through AI-powered indexing and vector embeddings

  2. unify-apps

    Snowflake as Destination

    Unify Data

    Load and transform your data seamlessly into Snowflake's cloud data warehouse with UnifyApps' native connectivity and flexible staging options.

  3. unify-apps

    Quentn

    Unify Integrations

    Integrate your app with Quentn to automate marketing campaigns, manage contacts, and optimize customer engagement.

  4. unify-apps

    Connector SDK

    Platform Tools

    Create custom connectors within the Unify platform to unify workflows

  5. unify-apps

    JobNimbus

    Unify Integrations

    Integrate your application with JobNimbus to manage leads, track jobs, and streamline your workflow processes efficiently

  6. unify-apps

    IMAP

    Unify Integrations

    Integrate your app with IMAP to enable seamless email synchronization, real-time access, and enhanced communication workflows.

  7. unify-apps

    Basin

    Unify Integrations

    Basin transforms form handling from a backend headache into a simple integration—capture submissions, trigger automations, and manage data flows without touching server code, giving you more time to build what matters

  8. unify-apps

    Preview Your Work

    Unify Automations

    Effortlessly review & monitor your automation’s performance

  9. unify-apps

    QuickBooks

    Unify Integrations

    Integrate your app with QuickBooks to streamline accounting, automate invoicing, and manage finances effortlessly

  10. unify-apps

    FTP/FTPS

    Unify Integrations

    Connect your app with FTP/FTPS to automate secure file transfers and streamline data exchange across systems.

  11. unify-apps

    Data-Sync by Avoid Duplicate Operations Setting

    Unify Data

    Prevent infinite loops in bidirectional data synchronization by creating record hashes that ensure one-way data flow across connected systems.

  12. unify-apps

    Filters

    Unify Applications

    Enable users to refine, search, and sort data effortlessly across dashboards and datasets

  13. unify-apps

    Insided

    Unify Integrations

    Integrate your app with Insided to enhance customer engagement, streamline community management, and drive self-service support.

  14. unify-apps

    Reverse Polling

    Unify Data

    Reverse Polling technique efficiently retrieves recent data from APIs that return results in chronological order (oldest first), optimizing pagination and data processing strategies when working with time-ordered data sources.

  15. unify-apps

    Facebook Ads

    Unify Integrations

    Connect your app with Facebook Ads to automate campaign management, optimize ad performance, and track marketing success.

  16. unify-apps

    Duplicate Field

    Unify Integrations

    Create independent copies of your data fields to enable multiple mappings while preserving original values for validation and complex workflows.

  17. unify-apps

    Gainsight

    Unify Integrations

    Integrate your app with Gainsight to enhance customer success, automate engagement workflows, and drive retention

  18. unify-apps

    Simplesat

    Unify Integrations

    Integrate your app with Simplesat to collect real-time customer feedback, measure satisfaction, and improve service quality.

  19. unify-apps

    Livestorm

    Unify Integrations

    Integrate your app with Livestorm to streamline webinar hosting, automate event management, and enhance audience engagement.

  20. unify-apps

    Application Connectors

    Unify Data

    Instantly leverage 30+ pre-built application connectors to extract, transform, and load your business-critical data between systems with UnifyApps' no-code integration platform.

Unify Agentic AI
Logo
Overview
Logo
Content Filters

Content Filters

Logo

2 mins READ

Content Filters act as guardrails for AI conversations, ensuring that both user inputs and AI responses stay within appropriate boundaries. Content Filters evaluate responses bidirectionally - 

  • Checking what users send to the agent

  • Monitoring the agent response generated

Image
Image


There are two components of Content Filters:

  1. Filter Strength for Prompts : This allows you to adjust the intensity of the filter to detect and block unwanted content in user prompts. You can increase the strength of content filtering based on the categories you want to monitor, such as:

    • Hate Speech: Blocks content that discriminates or insults individuals or groups.

    • Insults: Identifies offensive or disrespectful language aimed at individuals or groups.

    • Violence: Detects content promoting harm or aggression.

    • Sexual Content: Blocks sexually explicit or suggestive material.

    • Misconduct: Prevents content describing illegal or unethical behaviour.

    • Prompt Attacks: Filters attempt to manipulate the AI system’s safeguards.

  2. Filter Strength for Responses : This similarly to the prompt filter but applies to the AI agent’s responses. It ensures that the AI-generated responses are free from harmful or inappropriate content. 

    • Hate Speech: Blocks content that discriminates or insults individuals or groups.

    • Insults: Identifies offensive or disrespectful language aimed at individuals or groups.

    • Violence: Detects content promoting harm or aggression.

    • Sexual Content: Blocks sexually explicit or suggestive material.

    • Misconduct: Prevents content describing illegal or unethical behaviour.

How Do Content Filters Work?

The system uses different levels of filtering strength that you can adjust based on your needs:

  • None: No filtering applied

  • Low: Blocks only the most obvious inappropriate content

  • Medium: Provides balanced protection

  • High: Offers maximum safety with strict filtering

For example, content filters detect and flag inappropriate content. 

User Query: "You are very [derogatory remark] , you could not even complete a single task on time. 

Content Filter Analysis:

   ⚠️ Insult Detection: Derogatory term

User Query: "I'm so angry at my neighbor, I want to destroy their property!"

Content Filter Analysis:

  • 🚨 Violence: Threat of property damage (HIGH confidence)


How to Configure Content Filters in your AI Agent?

  1. In the Guardrails section of your AI Agent dashboard, click on “Content Filters”.

  2. Under Filter Strength for Prompts, use the sliders to control how strictly the AI filters content in user prompts. You can set the intensity from None to High for each category (Hate, Insults, Violence, Sexual, Misconduct, Prompt Attack).

    Image
    Image

  3. Similarly, under Filter Strength for Responses, use the sliders to set the filter levels for generated responses, ensuring that the agent's output complies with your ethical and content guidelines.

    Image
    Image
  4. By adjusting these content filters, you can ensure that your AI agents operate safely and deliver appropriate, respectful communication while adhering to your brand’s policies and compliance standards.