How does DomDistiller identify the main content on a webpage?

DomDistiller identifies the main content through a heuristic scoring system. It assigns positive scores to elements with high text density, semantic HTML tags like <article> and <p>, and negative scores to elements with high link density or tags like <nav>, <footer>, and classes/IDs containing terms like 'comment', 'ad', or 'sidebar'.