heretic

We have hosted the application heretic in order to run this application in our online workstations with Wine or directly.

Run heretic online

Quick description about heretic:

Heretic is an open-source Python tool that automatically removes the built-in censorship or �safety alignment� from transformer-based language models so they respond to a broader range of prompts with fewer refusals. It works by applying directional ablation techniques and a parameter optimization strategy to adjust internal model behaviors without expensive post-training or altering the core capabilities. Designed for researchers and advanced users, Heretic makes it possible to study and experiment with uncensored model responses in a reproducible, automated way. The project can decensor many popular dense and some mixture-of-experts (MoE) models, supporting workflows that would otherwise require manual tuning. Beyond simple decensoring, Heretic includes research-oriented options for analyzing model internals and interpretability data.

Features:

Fully automatic censorship removal
Support for transformer-based LLMs
TPE-based parameter optimization
Command-line usability
Research tools for model analysis
CLI + Python interface

Programming Language: Python.
Categories:

Large Language Models (LLM)

Page navigation:

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.