Writing Tips: Results Subsections

The purpose of a Results section is to present, without interpretation, the key results of your research. Your paper does not need to include every result you obtained during your experiments. Results are “key” when they are relevant to addressing the research questions or hypotheses presented at the beginning of your paper.

We use the Results subsections to show the reader what types of outcomes they can expect when using the methodology that we present. In our papers, we write a “Methods Overview” as the first subsection of the Results section. (We discuss writing the “Methods Overview” subsection in a previous writing tips post.) Remaining subsections in your paper’s Results section present your findings in the form of text, figures, and tables.

Each Results subsection should make a specific point, and the subsection heading should be a succinct description of this message. Effective subsection headings declare a statement that communicates to the reader what the method is capable of doing or what types of data the method can be applied to. For example, in a recent paper published by our group, the heading of a subsection that demonstrates how a new GWAS approach controls for false positive results is: “Phenotype Imputation Controls Type 1 Error.”

Here, a two-paragraph Results subsection has a heading that tells the reader which specific type of analysis is discussed, since the paper presents a method that can be applied toward numerous different analytical tasks.

Cell type composition and diversity

 

We hypothesized that differences in microbial diversity may be linked to whole blood cell type composition. Since the actual cell counts were not available for these individuals, we used cell-proportion estimates derived from available DNA methylation data to test this hypothesis (Houseman et al. 2012; Aryee et al. 2014; Horvath and Levine 2015).

 

We assessed methylation data from 65 controls from our replication sample, and compared methylation-derived blood cell proportions to alpha diversity after adjusting for age, gender, RIN, and all technical parameters. We tested whether alpha diversity levels are associated to cell type abundance estimates. Our analysis shows one cell type, CD8+ CD28- CD45RA- cells, to be significantly negatively correlated with alpha diversity after correction for all other cell-count estimates (correlation = -0.41, P=7.3e-4, Figure S6, Table S6). These cells are T cells that lack CD8+ naïve cell markers CD28 and CD45RA and are thought to represent a subpopulation of differentiated CD8+ T cells (Koch et al. 2008; Horvath and Levine 2015). We observed that low alpha diversity correlates with high levels of this population of T cells cell abundance.

 

Total RNA Sequencing reveals microbial communities in human blood and disease specific effects

Mangul, Serghei; Loohuis, Loes Olde M; Ori, Anil; Jospin, Guillaume; Koslicki, David; Yang, Harry Taegyun; Wu, Timothy; Boks, Marco P; Lomen-Hoerth, Catherine; Wiedau-Pazos, Martina; Cantor, Rita; de Vos, Willem M; Kahn, Rene S; Eskin, Eleazar; Ophoff, Roel A

Total RNA Sequencing reveals microbial communities in human blood and disease specific effects. Journal Article

In: BioRxiv, (057570), 2016.

Abstract | Links | BibTeX

For each subsection, we include one figure that illustrates the heading’s message. The figure’s legend (also referred to as a “caption”) can simply be the subsection heading with additional information explaining the methods and data involved in the visual output. It may be helpful to select a figure and write a legend before composing text for the subsection.

At this point, you could probably write an entire paper on each figure! In general, we limit the text in each Results subsection to one to two paragraphs. Here, we use the minimum amount of text that is necessary to walk our reader through the figure. Think about what the reader needs to know in order to start using the method for their own analysis. Relevant information includes the type of data used, analytical steps and parameters, and a summary of conclusions. In many cases, the subsection text and figure legend will be repetitive.

This one-paragraph section provides relevant results in terms of statistical parameters, numerical output, and a supplemental figure. This subsection gives the reader a good idea of what to expect if they want to incorporate this new approach in their own project.

Phenotype Imputation Controls Type I Error

 

We simulated datasets for multiple phenotypes under the null model where the variant we are testing has no effect (effect size of zero) toward the target phenotype. We computed the type I error under five different significance thresholds: 0.05, 0.01, 0.005, 5 3 10-6, and 5 3 10-8. We generated 100,000,000 simulated datasets that consist of 1,000 individuals. The type I error rates for our imputation method were 0.049, 0.0099, 0.00489, 4.90 3 10-6, and 4.89 3 10-8 for the significance thresholds of 0.05, 0.01, 0.005, 5 3 10-6, and 5 3 10-8, respectively. This indicates that the type I error is correctly controlled in our imputation method. The Northern Finland Birth Cohort dataset 13 was used to show that the type I error is controlled (see Figure S1). We plot the Q-Q plot of the Z-score for the imputed triglyceride (TG) phenotype from the Finland dataset. There is no inflation in the Q-Qplot as shown in Figure S1.

 

Imputing Phenotypes for Genome-wide Association Studies

Hormozdiari, Farhad ; Kang, Eun Yong ; Bilow, Michael ; Ben-David, Eyal ; Vulpe, Chris ; McLachlan, Stela ; Lusis, Aldons J; Han, Buhm ; Eskin, Eleazar

Imputing Phenotypes for Genome-wide Association Studies. Journal Article

In: Am J Hum Genet, 99 (1), pp. 89-103, 2016, ISSN: 1537-6605.

Abstract | Links | BibTeX

Bonus challenge: After you finish writing your paper, try to remove the sentence highlighting the result’s importance from the Figure caption.

The order in which you present your results can be organized in many different ways. Typically, ordering of subsections is not important for initial manuscripts. One simple approach is to order Results subsections sequentially to support the argument that you are building in your paper.

Here, we present another example of a Results subsection, including the description of a relevant figure. The subsection heading is making it clear to the reader that this part of the paper discusses applying ForestPMPlot, a visualization tool for analyzing meta-analysis studies, to eQTL data.

Application to multi-tissue eQTL analysis

 

One powerful application of our proposed framework is in multi-tissue eQTL analysis in the Genotype-Tissue Expression (GTEx) project. The GTEx project studies human gene expression and genetic regulation in multiple tissues, providing valuable insights into the mechanisms of gene regulation, which can lead to the new discovery of disease-related perturbations. In this project, genetic variation between individuals will be examined for correlation with differences in gene expression level to identify regions of the genome that influence whether, and by how much, a gene is expressed. In particular, examining multiple tissues can give us valuable insights into the genetic architecture of the regulatory mechanism, because many regulatory regions are known to act in a tissue specific manner (Ernst et al. 2011; Encode Project Consortium 2012). Hence, understanding the role of regulatory variants, and the tissues in which they act, is essential for the functional interpretation of GWAS loci and insights into disease etiology.

 

Figure 2 is an example of the output of ForestPMPlot for a multitissue eQTL study for SEMA3B gene (GTEx Consortium 2015). Examining both the forest plot and the PM-Plot allows us to obtain an insight into the tissue-specific genetics effects in eQTL analysis, which leads to the identification of three significant eQTL tissues (heart left ventricle, stomach, and thyroid). This example clearly shows that examining both the forest plot and the PM-Plot allows us to easily hypothesize that there is a specific group of studies showing tissue differences in eQTL analysis.

 

ForestPMPlot: A Flexible Tool for Visualizing Heterogeneity Between Studies in Meta-analysis

Kang, Eun Yong; Park, Yurang; Li, Xiao; Segrè, Ayellet V; Han, Buhm; Eskin, Eleazar

ForestPMPlot: A Flexible Tool for Visualizing Heterogeneity between Studies in Meta-analysis. Journal Article

In: G3 (Bethesda), 6 (7), pp. 1793-8, 2016, ISSN: 2160-1836.

Abstract | Links | BibTeX


Below, we provide examples of several different types of figures that can illustrate the point of a Results subsection.

Example of a figure and figure caption that clearly illustrate and explain significance of results in a Results subsection (Hormozdiari et al. 2016).

Example of a figure and figure caption that clearly illustrate and explain significance of results in a Results subsection (Hormozdiari et al. 2016).

 

Example of a more complex figure and figure caption in a Results subsection, which aim to explain the advantages of a new visualization tool (Kang et al. 2016).

Example of a more complex figure and figure caption in a Results subsection, which aim to explain the advantages of a new visualization tool (Kang et al. 2016).

 

Example of a general schematic “Methods Overview” subsection figure in the Results section (Mangul et al. 2016).

Example of a general schematic “Methods Overview” subsection figure in the Results section (Mangul et al. 2016).

 

Writing Tips: Methods Overview

What are the interesting computational ideas underlying a new computational method?  What are the intuitions behind the method?  How is the method related to other methods?  These are the key question that papers which describe new computational methods should be answering.
Unfortunately, most papers describing new computational methods don’t explicitly address these questions due to constraints of the journal styles.  Introduction of methods papers often have a only few sentences about the method.  The Methods section typically has many more details but has very little discussion of the underlying ideas.   Understanding what is interesting about a method is left completely to the readers imagination.  Often, the journals request that the Results section precede the Methods section which then makes understanding the results very difficult without the reader reading the sections of the paper out of order.  Authors can appeal to the journal to have the Methods section first, but this is also not a good solution since there are many details in the Methods such as descriptions of the datasets which take away from the flow of the paper.
In order to avoid these problems, in our papers, we make the first subsection of the Results section of the paper a “Methods Overview.”  In this section, we describe the method in terms of the high level ideas and typically include as a figure a small example which we utilize the help the reader understand the example.   The goal of this section is to give enough details that the readers can then follow the rest of the Results section without requiring looking at the Methods section.  A well written Methods Overview will make it much easier for the reader to follow the actual Methods section.
These sections and examples are designed to be self contained and should be in a language appropriate for a general audience.  In fact, some of the blog posts are almost verbatim copies of the Methods Overview sections of some of our recent papers.  For example, see these blog posts on GRAT and Genome Reassembly.
Another way to think of what to put in the Methods Overview section is what you would explain in a talk about the method.  Often presentations on computational methods have excellent slides showing intuitions and very clear examples.  The place to put that kind of material is in the Methods Overview.  Remember, in your paper you must give a compelling argument as to WHY your method is interesting. If your readers don’t understand the intuitions underlying your work, they will never appreciate it.
I’m sure you may be asking, “Isn’t this a little redundant?” What I’m proposing here may be a bit repetitive, with a methods overview section and a methods section later in the paper.  But they serve different purposes.  With a well written Methods Overview section, a reader can stop after the Results section and understand most of your paper.  The Methods section then only becomes important for someone who wants to understand all of the details.

Writing Tips: Introduction

In this blog post, I would like to “introduce” you to our introduction style. Writing the introduction is the most daunting part of the paper writing process, especially for students who are not native english speakers. To help structure the introduction writing process, in our lab we have developed a standard style or template for writing introductions. Since the majority of the papers that we write are papers that describe new computational methods, many of our papers naturally fit into this style. We usually publish our papers in Genetics journals which have very high standards of writing and are read by researchers with a wide range of backgrounds. The difference between a paper getting accepted and rejected is often determined by the clarity of the writing.

Our introduction style is a very specific formula that works for us but obviously there are other ways to structure an introduction and each experienced writer will have their own style. However, the truth is, you NEVER start out as a good writer and new writers need to start somewhere. It takes practice, consistency and effort to write well. If you are a new writer apprehensive about writing an introduction, we hope that this structure can help you.

Our introductions are typically four paragraphs long with each paragraph serving a specific role:
1. Context – First, it is important to explain the context of the research topic. Why is the general topic important? What is happening in the field today that makes this a valid topic of research?
2. Problem – Secondly, you present the problem . We typically start this paragraph with a “However,” phrase. Simple example: We have this awesome discovery in XYZ… However, using former methods it will take us 10 years to run the data. Each sentence in this paragraph should have a negative tone.
3. Solution – By this point, your readers should sympathize with how terrible this problem is and how there MUST be a solution (maybe a little dramatic, but you get my point). Paragraph three always starts with “in this paper” and a descritpion of what the paper proposes and how it solves the problem in the second paragraph.
4. Implication – The last paragraph in your introduction is the implication, which describes why your solution is important and moves the field forward. Typically, in this paragraph is where you summarize the experimental results and how they demonstrate that the solution solves the problem. This paragraph should answer the readers question of “so what?”.

An example of the 4 paragraph introduction style is in the following paper:

Mangul, Serghei; Wu, Nicholas C; Mancuso, Nicholas; Zelikovsky, Alex; Sun, Ren; Eskin, Eleazar

Accurate viral population assembly from ultra-deep sequencing data. Journal Article

In: Bioinformatics, 30 (12), pp. i329-i337, 2014, ISSN: 1367-4811.

Abstract | Links | BibTeX

Most of our other papers in their final form do not follow this format exactly.  But many of them in earlier drafts used this template and then during the revision process, added a paragraph or two expanding one of the paragraphs in the template.  For example, this paper expanded the implication to two paragraphs:

Kang, Eun Yong; Han, Buhm; Furlotte, Nicholas; Joo, Jong Wha J; Shih, Diana; Davis, Richard C; Lusis, Aldons J; Eskin, Eleazar

Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice Journal Article

In: PLoS Genet, 10 (1), pp. e1004022, 2014, ISSN: 1553-7404.

Abstract | Links | BibTeX

and this paper expanded both the context and problem to two paragraphs each:

Sul, Jae Hoon; Han, Buhm ; Ye, Chun ; Choi, Ted ; Eskin, Eleazar

Effectively Identifying eQTLs from Multiple Tissues by Combining Mixed Model and Meta-analytic Approaches Journal Article

In: PLoS Genet, 9 (6), pp. e1003491, 2013, ISSN: 1553-7404.

Abstract | Links | BibTeX

For methods papers, sometimes what are proposing is an incremental improvement over another solution. In this case, moving from the context to the problem is very difficult without explaining the other solution. For this scenario, we suggest the following six-paragraph structure:
Context
Problem 1 (the BIG problem)
Solution 1 (the previous method)
Problem 2 (Why does the previous method fall short?)
Solution 2 (“In this paper” you are going to improve Solution 1)
Implication

An example of 6 paragraph introductions where the 3rd and 4th paragraph were merged is:

Furlotte, Nicholas A; Kang, Eun Yong; Nas, Atila Van; Farber, Charles R; Lusis, Aldons J; Eskin, Eleazar

Increasing Association Mapping Power and Resolution in Mouse Genetic Studies Through the Use of Meta-analysis for Structured Populations. Journal Article

In: Genetics, 191 (3), pp. 959-67, 2012, ISSN: 1943-2631.

Abstract | Links | BibTeX

There it is… the beginning to a great paper (at least we like to think so!). Will this work for you? Have other ideas? Let us know in the comments below!