Skip to content

Conversation

@anonymoususer72041
Copy link

Summary

This change adds a default robots.txt that blocks crawling of the entire OpenCATS installation by default while explicitly allowing the public careers portal and the static assets it needs.

The file whitelists the /careers/ path as the preferred public entry point for job listings and job detail/apply pages, and allows common asset paths such as /js/, /images/, /main.css, /careersPage.css, /ie.css and /not-ie.css so that search engines can properly render the public careers pages.

For installations that expose public careers pages via query-based URLs, the robots.txt includes commented Allow rules for /index.php?m=careers variants together with an explanatory comment.

Motivation

OpenCATS is primarily an internal ATS, so most of the application (including candidate data, client data, attachments, and admin interfaces) should not be discoverable or indexed by search engines. At the same time, many deployments rely on the public careers portal to advertise open positions and want those job pages to remain indexable.

Adding a conservative robots.txt with a "block everything by default, explicitly allow the public careers portal and its assets" approach helps prevent accidental indexing of internal areas while still supporting SEO for job postings.

@anonymoususer72041 anonymoususer72041 force-pushed the security/add-default-robots-txt branch from fc4ef71 to ae4e93c Compare December 7, 2025 15:26
anonymoususer72041 added a commit to anonymoususer72041/OpenCATS that referenced this pull request Dec 11, 2025
commit ae4e93c
Author: anonymoususer72041 <247563575+anonymoususer72041@users.noreply.github.com>
Date:   Sun Dec 7 15:36:04 2025 +0100

    Add default robots.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant