Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

p at e n t s

Patent landscaping for life sciences innovation:


toward consistent and transparent practices
Tania Bubela, E Richard Gold, Gregory D Graff, Daniel R Cahoy, Dianne Nicol & David Castle
As industry, governments and academia increasingly rely on patent landscapes to map scientific and
technological trends, an interdisciplinary workshop provides recommendations for developing consistent and
transparent landscaping practices.

M ore than ever before, industry, govern- persist in landscaping techniques3. These cre- influence of intellectual property (IP) rights
© 2013 Nature America, Inc. All rights reserved.

ments and academia rely on ‘landscapes’ ate obstacles in assessing the informational on innovation. Although freedom-to-operate
to map scientific and technological trends value of individual landscapes and in com- opinions fall within the formal definition of a
within specific fields of science and technol- paring, combining and extending multiple landscape, they follow a set of norms unique
ogy. A landscape is an analysis of the relation- landscapes. A second challenge arises from to the practice of law and focus on the analy-
ships between multiple sets of indicators or of the lack of transparency about the data and sis of potential liabilities. Also closely related
those indicators measured against temporal, techniques used. In response, three of the are corporate, regional or national assess-
technical or spatial dimensions. Indicators authors (Bubela, Castle, Gold) organized a ments of R&D activities; however, to the
might include scientific articles, patents, clin- workshop with representatives from indus- extent that they cut across multiple fields of
ical or field trials, regulatory approvals, and try, academia, public funding agencies, pat- technology found within the individual firm
actors or institutions. Additional analyses can ent offices and other government agencies. or geographic region, while excluding any
represent network connections or the density Workshop participants discussed landscap- context beyond its boundaries, they fail to fit
of clusters of scientific or technological fields. ing practices and challenges for the repre- the definition of a landscape. We therefore do
As countries within the Organization for sentative field of synthetic biology. Based not discuss them further.
Economic Co-operation and Development on workshop presentations and discussion,
strive to integrate science policy and innova- the authors developed recommendations to Tailoring analytical strategies
tion strategy1, they place greater reliance on improve landscaping methodologies and to The most effective patent landscaping pro-
landscapes to track trends and support the make their use more consistent and trans- tocols align scope and methodology with
coordination of activities, actors and institu- parent. Workshop participants reviewed the the purpose of the target audience and the
npg

tions2. recommendations. This article focuses on specific issues to be addressed (Table 1).
Despite the growing prevalence and patents as the most commonly used indica- Various audiences, particularly policy makers
importance of landscapes, including some tor in landscapes4. and academic consumers, share an interest in
published in high-impact scientific and Patent landscapes vary in scale and scope, some of the same issues; other communitites
policy journals, serious inconsistencies ranging from specific in-depth analyses have narrowly defined, unique interests.
of a narrow range of patents to large-scale Similarly, practitioners of patent landscap-
landscaping of entire technological fields. ing vary from dedicated corporate actors
Tania Bubela is at the School of Public Health, Landscapes of any scale seek to encompass, conducting analyses for internal or contract
University of Alberta, Edmonton, Alberta, as much as possible, an entire population of purposes, to researchers interested in aca-
Canada; E. Richard Gold is in the Faculty of relevant data, rather than a random sample demic or policy questions, to government
Law, McGill University, Montreal, Quebec, drawn from that population. In science and agencies, such as policy branches of national
Canada; Gregory D. Graff is in the Department technology, as well as in legal disputes, single IP offices and international organizations.
of Agricultural and Resource Economics, seminal events can be crucial to understand- For industry, patent landscapes can enable
Colorado State University, Fort Collins, ing an entire field, but can easily be missed in a high-level understanding of a sector for
Colorado, USA; Daniel R. Cahoy is at Smeal a random sample. The resulting data can be strategic planning, especially across juris-
College of Business, The Pennsylvania State visualized graphically or can comprise counts dictions5. An analysis of other actors in the
University, University Park, Pennsylvania, USA; of indicators across selected dimensions. same technology space can assist a company
Dianne Nicol is in the Faculty of Law, University The data capture portions of the applied and in evaluating its initial idea in relation to
of Tasmania, Hobart, Tasmania, Australia; and translational research environment, identify- potential competitors, distribution channels
David Castle is at Innogen Institute, University ing areas of research and development (R&D) and partners. An analysis of the way claims
of Edinburgh, Edinburgh, UK. considered to be of commercial value and are drafted (e.g., narrowly or broadly) for
e-mail: tbubela@ualberta.ca providing an understanding of the potential similar technologies in different jurisdictions

202 volume 31 number 3 MARCH 2013 nature biotechnology


pat e n t s

or areas of growth. Conversely, investment


Table 1 Landscaping issues and analysis can be spurred in underfunded or emer-
Issues that can be addressed gent fields in response to a growth in R&D
through landscaping Type of landscape/analysis Common metrics in competitor markets. Tracking inven-
What aspects of particular Technology landscape Classification/claim type/ tor affiliations and applicant firm histo-
technologies, products or fields Comparative technology landscape technology keywords X
do patents cover? • Issued patents
ries can enable policy makers to analyze
Prior art search the flow of human resources within and
• Patent applications
Licensing opportunity analysis between jurisdictions. For example, China
• Country of origin
• Assignee/inventor is attracting researchers and promoting
How do patent rights affect Institutional portfolio analysis Assignee(s)/inventor(s) X studies abroad for its citizens to advance its
certain firms or institutions? Inventor portfolio analysis • Issued patents high-technology research agenda in fields
Performance review • Patent applications such as nanotech9.
Competitor analysis • Country of origin Further, assessing the level of innovation in
Industry analysis • Classification/claim type specific fields can assist regulators and tech-
How do patent rights map to Regional innovation indicators Geography/region X nology assessment agencies to proactively
geographic regions or Innovation cluster analysis • Classification/claim type develop policies and procedures for emerging
countries? Foreign filing analysis technologies. The forecasts enabled by land-
• Issued patents
International patent family landscape • Patent applications scapes are invaluable because it takes time
Country/region of inventor and move- • Country of origin to adapt existing regulatory frameworks for
ment over time
• Assignee/inventor health, food and drugs, environmental and
Which patents are the most Claim construction • Number/scope of claims laboratory practices to new technologies.
© 2013 Nature America, Inc. All rights reserved.

important or valuable? Bibliometric analyses • Forward citation In addition to defining the broad con-
Litigation analysis • Patent families tours of a technology, patent landscaping
• Litigation can be used to address specific questions
• Maintenance fees of policy and academic relevance. These
How do patents relate to one Bibliometric analyses • Forward/backward citations include assessing the impact of permissive or
another? Network citation analyses • Keywords restrictive research and IP policies in specific
Semantic similarity analysis • Co-inventorship technology sectors and identifying emerging
• Assignee/inventor links patent thickets, blocking patents or potential
• Network statistics antitrust issues. With this knowledge, policy
How might patents affect Patent counts • Scope of claims makers will be in a better position to inter-
innovation or competition Patent claims analysis • Issued patents
policy?
vene—through the exercise of government
Patent density • Classification/claim type use rights, waiver of sovereign immunity,
Statistical modeling • Patent applications compulsory licensing or the development
• Country of origin of guidelines for the issuance of patents—or
• Assignee/inventor refrain from intervening, as appropriate.

can assist in developing patent drafting strate- and policy makers. For example, the JPO Patent landscaping methodologies
gies. Referred to as claims construction, such has been tracking the rise of patenting by When engaging in patent landscaping, prac-
npg

analysis can also assist in identifying prior art Chinese applicants in Japan, especially in titioners generally begin by defining the
that ought to be referenced or distinguished the field of nanotech. The European Patent purpose of the proposed landscape and its
to meet patent criteria of novelty and inven- Office (EPO) engaged in a joint project with potential scope combined with a realistic
tiveness. Monitoring the field is essential for the United Nations Environment Program assessment of time and cost constraints4.
longer-term asset management and the devel- and the International Centre for Trade and Data collection and processing phases are
opment of market strategies as well as firm val- Sustainable Development (ICTSD) to land- especially time consuming, and the need to
uation and portfolio analysis for investment, scape patents on clean energy technologies8. access proprietary databases can increase
and mergers and acquisitions6. Patent offices are uniquely positioned to pro- costs. Maximizing data capture at the first
Some IP offices engage in larger-scale vide landscapes to guide national industrial stage increases the likelihood that retrieved
patent landscaping to assist national or policy and firm strategies. They have access data can be reanalyzed if the focus changes
regional innovative firms and government to internal databases—including correspon- as different analyses based on different sets
agencies that set science and technology dence between examiners and applicants— of metrics can be applied to collected data
policies or funding strategies. For example, and to highly qualified technical staff with sets, depending on the issue to be addressed.
the United Kingdom Intellectual Property not only scientific knowledge, but also an The most common strategy is to define a field
Office (UKIPO) has a policy branch that understanding of how to interpret patent of interest and develop a search strategy that
produces landscapes and analyses of specific claims, of emerging terminology and of the captures as complete a set of patent docu-
technology domains of strategic interest for patenting process itself. ments as possible. An alternative strategy is to
economic development in the UK, such as Broader analyses comparing technology define foundational or enabling technology
stem cell research7. Similarly, the policy unit fields over time and between jurisdictions patents and then build a larger set of follow-on
of the Japanese Patent Office (JPO) tracks can inform national investments in R&D. patents through linkages between patent doc-
technology of national interest to provide Funding can be directed toward identi- uments, such as by following citations. This
advice and information to Japanese industry fied regional strengths, innovative sectors second strategy can be useful for emerging

nature biotechnology volume 31 number 3 MARCH 2013 203


pat e n t s

but ill-defined fields such as nanotech and an exception to the latter, enabling search- new Cooperative Patent Classification has
synthetic biology. Figure 1 illustrates the ing in multiple languages and tools specific recently been made available as a joint ini-
iterative process of patent landscaping. for searching protein and DNA sequences tiative to speed the patent granting process
worldwide (http://www.patentlens.net). The between the US Patent and Trademark Office
Search. Once the broad issues and the over- World Intellectual Property Organization’s and the EPO, and has replaced the US and
all strategy have been determined, the next (WIPO) Patent Scope is another excep- European classes as of January 1, 2013.
step is to design the specific search strat- tion, as it provides basic visual represen- Other limits on searches can be orga-
egy. This necessitates an identification of a tations of patent application data such as nized by geography, assignees, inventors
suitable patent database from among both inventor or country of origin (http://patent or other definitional fields. One example
publicly available and proprietary options, scope.wipo.int/search/en/search.jsf ). Private is the Ade/Cook-Deegan algorithm used
many of which are listed in a resource pro- databases, most notably Thomson Innovation by the DNA Patent Database housed at
vided by the Patent Information Users Group and Elsevier’s Scopus Database, enable com- Georgetown University in Washington, DC,
(http://piug.wildapricot.org/vendors), fol- plex search algorithms (Boolean and/or to identify all DNA and RNA patents in
lowed by the development of database- natural language) across multiple jurisdic- the Delphion database (http://dnapatents.
specific search algorithms combined with tions, provide hand-curated information in georgetown.edu/SearchAlgorithm-
limits to the scope of the search (for example, addition to standard information based on Delphion-20030512.htm). Another is the
date, legal status or patent class/code)10. At public data, and higher-order analytics, such algorithm developed by Bergman and Graff
this stage, it is advantageous to consult with as corporate histories and sophisticated visu- to capture all stem cell–related patents in
a specialist in information science who is spe- alization tools. the Thomson Innovation database12. Such
cifically trained in the development of search The database-specific search algorithm algorithms are developed in consultation
algorithms. can combine sets of keywords comprising with technical experts. Expert validation
© 2013 Nature America, Inc. All rights reserved.

Although free, public databases have lim- synonyms for specific technologies or broad ensures that important branches or key tech-
ited capabilities for complex search algo- fields. Keywords are often combined with nologies within a field are not omitted, and
rithms, provide only basic informational patent class codes that group patents into conversely, refines the algorithm to reduce
fields within specific countries or regions, technological categories. Patent class codes the percentage of irrelevant documents. The
and generally provide limited higher- include the US Class (USC), International steps of algorithm development, and the next
order analytic tools11. PatentLens, from Patent Class (IPC), European Patent Class steps of data collection, data cleaning, data
the Initiative for Open Innovation (IOI), is (EPC) and Derwent Manual Codes. The curation and exploratory analyses are neces-
sarily iterative and reliant on expert input to
arrive at a final data set capable of address-
ing strategic, policy or academic questions.
Assess time & The breadth or specificity of such questions
cost constraints is dependent on scale and quality.

Cleaning and curating. The next step,


Define purpose and scope of landscape after the search for and collection of pat-
ent documents, is to clean and curate data,
a time-­consuming exercise, especially if data
Consult with have been collected from multiple public
npg

Design search strategy


information & databases. Cleaning involves reviewing the
• Identify database(s)
technical • Develop search algorithm(s) documents in the data and discarding any
specialists that are outside of the intended scope of the
analysis. Cleaning can be automated using
Data cleaning & curation criteria built into the search function, or
• Merge data sets technical experts can do it manually if the
• Define degree of error tolerated data set is not impracticably large. Data cura-
• Remove irrelevant documents
tion includes the merging of data from dif-
ferent databases into consistently formatted
and structured data sets, possibly followed
Augment data set with by manual or automated mechanisms for
Expert periodic future data updates. International
additional fields or
validation
manual coding analysis of patent data necessarily introduces
the complication of single inventions being
represented as ‘families’ of filings at multiple
patent offices. Curation of a landscape involv-
ing more than one jurisdiction must manage
Higher-order analytics the data at the level of patent family and the
Descriptive Visualization • Importance/value
statistics of trends • Relationships between documents level of individual patent documents8.
• Impact of document on innovation The degree of error that can be toler-
ated, both in terms of the relevance of the
Figure 1 The iterative process of patent landscaping. document set and the structure of individual

204 volume 31 number 3 MARCH 2013 nature biotechnology


pat e n t s

fields, depends on the scale of the analysis, Qualitative characteristics of patents, such next-generation genetic technologies. This
partly based on considerations of time and as their scientific significance or private eco- is based on a novel automated landscaping
cost4. Nevertheless, landscapers must assess nomic value, can be determined by exam- method developed by Jensen and Murray,
the nature (for example, a tendency toward ining the number of claims, the number of which mapped sequences named in pat-
over- or under-inclusion) and magnitude of jurisdictions covered by the patent family and ents (as SEQ IDs) to the human genome22.
errors, possibly through a detailed analysis of the number of times the patent is cited14,15. Further examination of a subset of the pat-
a subset of the data set. For broad landscapes Studies have shown that the number of cita- ents by the most accurate method of claim
directed toward identifying, as completely as tions made to a patent is related to the private construction—reading by experts—found
possible, an entire population of documents, economic value of that patent16,17. The tech- that far fewer of these alleged human gene
a metric for assessing data saturation should nical diversity of citing patents, as indicated patents actually recite human DNA mole-
be defined and reported. The error types and by the range of technology classes they cover, cules, methods for genetic testing or genetic
rates, as well as underlying assumptions, must can be related to the scientific “generality” sequencing in the claims23,24. Although this
be reported so users may assess the reliability or “basicness” of the subject patent18. The does not undermine the value of the meth-
and validity of a given landscape. degree to which a patent has been litigated is odology developed by Jensen and Murray,
also correlated with patent value, and patents it points to the misunderstanding and pos-
Analysis. Analysis generally commences with can be linked to litigation databases19. sible misuse by others of landscaping stud-
standard exploratory or summary statistics The final, augmented data set can then ies, particularly where methods are unclear
per data field and simple analyses such as be analyzed using sophisticated visualiza- or misinterpreted.
how these fields vary temporally or by patent tion and graphing techniques or statistical Landscaping practitioners must be careful
jurisdiction. Landscapes, by definition, are modeling techniques, depending on the to select methodologies capable of answering
multidimensional, analyzing document attri- issues of interest20. There are many avail- questions of interest at the appropriate scale.
© 2013 Nature America, Inc. All rights reserved.

butes across combinations of time, geogra- able visualization and analytic tools with An iterative process with relevant expert
phy and technical field. Document attributes varying utility11. Geographic Information consultation is essential. The limitations of
equate to data fields such as inventors, assign- System (GIS) software can represent nodes the databases, search strategies and analyti-
ees or applicants, legal status, or technology (for example, individuals or cities) and the cal methods must be made explicit. Reliance
classification and/or code. Such simple ana- strength of the linkages between them on a on proprietary landscapes can accordingly
lytics, although nominally informative, also geospatial map. Software that visualizes col- be problematic because search algorithms
provide validation of the prior steps and can laborative networks of inventors or applicants and landscaping methods are rarely pub-
point to the need for further refinements can also display node attributes, the strength lished. Transparency would be improved if,
in the data. The data set can be augmented of the linkages between them and calcula- when proprietary databases are used, pat-
with additional data by, for example, cat- tions of network statistics that indicate how ent numbers are included in an appendix or
egorizing patent assignees or inventors as central or important an actor is within the Supplementary Information. This is simply
public or private sector actors; coding patent network. Thematic maps cluster documents good practice for all scientific and social sci-
claims for subject matter, scope or validity; or according to similarity of keywords, showing ence research; methods should be readily rep-
doing automated semantic analyses of key- peaks and valleys of activity within specific licable to assess the validity of conclusions.
words descriptive of specific fields or types fields. Lexicographic analyses further allow Journals can act as gatekeepers to ensure
of claims, such as distinguishing between the tracking of text between documents so methodological best practices for social sci-
products and processes. that keywords, phrases or concepts can be ence research.
For questions regarding the technology actu- traced through different forms of documents To improve landscaping activities, gov-
npg

ally protected, manual coding of the claims of originating with different sets of actors, for ernments, research institutions and private
the patent documents within the landscape is example, linking scientific publications with foundations should support the development
often essential for meaningful conclusions; for patents. Finally, bibliometric analyses of cita- of publicly accessible databases and associ-
example, coding whether patents for genetic tion patterns (for example, backward and ated analytical tools3. This holds true for
diagnostic tests are blocking or can be circum- forward citations and co-citations to other patent databases and for publications data-
vented by public laboratories13. Automated patents or scientific literature) can help to bases more generally. The best overview of
lexicographic analyses of claims provide only analyze the intellectual structure of a disci- an innovation landscape can be accomplished
limited inferences about the nature of the pline to track citations through generations of by situating the patent landscape within other
research activity or the scope of control because publications in a field, or to identify and track information of R&D and corporate activi-
they ignore the subtleties imposed by legisla- seminal or foundational documents20,21. ties, for example, by cross-referencing to
tive texts and court decisions. Experts such as publications and mergers-and-acquisitions
patent attorneys, agents and examiners are well Recommendations for practitioners databases.
versed in these subtleties and are best placed to Landscapes can be powerful tools for devel- Patent and other landscaping techniques
conduct such analysis. oping policy and business strategy. Most, are critical factors in developing science
Large data sets cannot realistically be however, are relatively simple, relying on and technology policy and business strat-
coded by hand. Here, analysts must rely on count data and possibly thematic maps4. egy. As with many other emerging tech-
coding frames to analyze claims language. In Many are opaque in the description of niques, early adopters have experimented
these cases, it is imperative that the methods, their methods, risking interpretations that with an assortment of methods to provide
the background of the coders, their coding go beyond the inherent limitations of the valuable insight into technology spaces.
training, measures of intercoder reliability, landscape in question. One example is the Landscaping techniques are now sufficiently
and the frame itself are adequately explained much-quoted statistic that 20% of human mature and used widely enough in support of
and contextualized. genes are patented and likely to impede science and innovation policies that general

nature biotechnology volume 31 number 3 MARCH 2013 205


pat e n t s

methodologies, reporting criteria and consis- COMPETING FINANCIAL INTERESTS 10. Bonino, D., Ciarmella, A. & Corno, F. World Pat. Inf.
tent practices ought to be established. These The authors declare no competing financial interests. 32, 30–38 (2010).
11. Yang, Y.Y., Akers, L., Klose, T. & Yang, C.B. World
will enhance the reliability of landscapes Pat. Inf. 30, 280–293 (2008).
upon which business, government and aca- 1. OECD. The OECD Innovation Strategy: Getting a Head 12. Bergman, K. & Graff, G.D. Nat. Biotechnol. 25,
Start on Tomorrow (OECD, Paris, 2010). 419–424 (2007).
demic leaders can base science and technol- 2. OECD. Collaborative Mechanisms for Intellectual 13. Huys, I., Berthels, N., Matthijs, G. & Van Overwalle,
ogy decisions. Property Management (OECD, Paris, 2011). G. Nat. Biotechnol. 27, 903–909 (2009).
3. Gold, E.R. & Baker, A.J. L. Inf. Sci. 22, 76–97 14. OECD. OECD Patent Statistics Manual (OECD, Paris,
(2012). 2009).
ACKNOWLEDGMENTS 4. Managing Knowledge in Synthetic Biology: The 15. Lanjouw, J.O. & Schankerman, M. Econ. J. 114,
Creation of Tools for Stronger Intellectual Property 441–465 (2004).
We would like to thank the participants from industry,
Analysis, Edinburgh, UK, June 20–21, 2012. 16. Trajtenberg, M. Rand J. Econ. 21, 172–187 (1990).
government agencies and academia at the Managing 5. Lee, S., Yoon, B., Lee, C. & Park, J. Technol. 17. Hall, B.H., Jaffe, A. & Trajtenberg, M. Rand J. Econ.
Knowledge in Synthetic Biology: The Creation of Tools Forecast. Soc. Change 76, 769–786 (2009). 36, 16–38 (2005).
for Stronger Intellectual Property Analysis workshop 6. Breitzman, A.F. & Mogee, M.E. J. Inf. Sci. 28, 187– 18. Hall, B.H., Jaffe, A.B. & Trajtenberg, M. NBER
hosted by the Innogen Institute of the University of 205 (2002). Working Paper 8498 (2001).
Edinburgh in 2012. The workshop was supported 7. United Kingdom Intellectual Property Office. 19. Allison, J.R., Lemley, M.A. & Walker, J. Georgetown
by VALGEN (Value Addition through Genomics) Regenerative Medicine: The Patent Landscape in Law J. 99, 677–712 (2011).
and GE3LS (Genomics and its Related Ethical, 2011 (Intellectual Property Office, Newport, 2011). 20. Small, H., Sweeney, E. & Greenlee, E. Scientometrica
<http://www.ipo.gov.uk/informatic-regenmed.pdf> 8, 321–340 (1985).
Environmental, Economic, Legal and Social Aspects),
8. United Nations Environment Program. Final report: 21. Bubela, T., Strotmann, A., Noble, R. & Morrison, S.
a project sponsored by the Government of Canada patents and clean energy: bridging the gap between Cell Stem Cell 7, 25–30 (2010).
through Genome Canada, Genome Prairie and Genome evidence and policy, (UNEP, EPO, ICTSD, Munich, 22. Jensen, K. & Murray, F. Science 310, 239–240
Quebec, and the PhytoMetaSyn project funded by 2010). <http://www.epo.org/news-issues/issues/ (2005).
Genome Canada, Genome Alberta and Genome clean-energy/study.html> 23. Holman, C.M. UMKC Law Rev. 80, 563–605 (2012).
Quebec. We thank A. Baker, D. Lewensohn, M. Bieber 9. Lenoir, T. & Herron, P. J. Biomed. Discov. Collab. 4, 24. Holman, C.M. Nat. Biotechnol. 30, 240–244
and L. Dacks for administrative and research support. 8 (2009). (2012).
© 2013 Nature America, Inc. All rights reserved.
npg

206 volume 31 number 3 MARCH 2013 nature biotechnology

You might also like